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Preface 


The  research  and  writing  that  went  into  this  thesis 
was  performed  to  assist  those  who  are  required  to  perform 
trend  analysis . If  any  of  the  readers  of  the  guide 
(Chapter  IV)  find  it  a useful  tool,  my  efforts  will  have 
been  worthwhile. 

The  guide  portion  of  this  thesis  was  written  with 
great  detail  and  extensive  explanations.  This  made  it 
quite  long,  but  hopefully  the  guide  itself  can  be  extracted 
from  the  main  body  of  the  thesis  so  it  will  not  look  too 
unwieldy.  Also,  some  parts  may  seem  to  be  over-explained 
or  repetitious.  Hopefully  the  user  will  not  feel  his 
intelligence  is  insulted  but  realize  I had  to  avoid  inade- 
quate explanations.  The  guide  is  intended  for  those  with 
very  little  mathematical  background. 

I wish  to  express  my  gratitude  to  the  original  sugges- 
tor,  Major  Michael  Dumiak , and  Colonel  Ronald  Luhks,  who 
offered  it  as  a thesis  topic.  Both  were  very  helpful  in 
getting  the  research  started.  Also,  I want  to  thank  my 
advisor,  Major  Saul  Young,  because  sessions  with  him 
tended  to  ease  frustration  and  put  me  more  at  ease  so  I 
could  write  more  productively. 

Finally,  my  most  heartfelt  appreciation  and  love  are 
extended  to  my  wife,  Sandy,  for  her  understanding,  per- 
sonal sacrifices,  and  moral  support  gladly  given. 


David  A.  Bruns tetter 
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Abstract 

This  research  effort  was  directed  toward  offering  a 
practical,  effective  guidebook  for  the  application  of 
statistical  trend  analysis  methods.  Standardization/ 
Evaluation  offices  throughout  the  Air  Force  are  required  to 
do  trend  analysis  on  measurements  taken  over  time,  but 
those  doing  it  often  have  no  mathematical  background. 
Chapter  IV,  The  User's  Guide,  is  written  for  them.  Ten 
trend  description/trend  evaluation  methods  are  offered, 
nine  dealing  directly  with  trend  analysis  and  one  with 
correlation.  They  are  discussed  in  detail,  then  are  pre- 
sented in  step-by-step  guidesheet  form.  The  significance 
of  trends  are  evaluated  using  the  z-statistic  for  large 
samples  and  the  t-statistic  for  small  samples  from  normal 
populations.  Three  more  easily  applied  non-parametric 
methods  are  offered  for  use  when  time  is  limited  or  assump- 
tions for  other  methods  cannot  be  met. 
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A GUIDE  FOR  THE  APPLICATION  OF 
TREND  ANALYSIS  AND 
OTHER  STATISTICAL  METHODS 

I . Introduction 
Statement  of  the  Problem 

Management  directives  throughout  the  operational 
commands  require  staff  agencies  to  monitor  training  pro- 
gram effectiveness  through  trend  analysis.  Unfortunately, 
little  or  no  ' .idance  is  given  as  to  how  to  measure, 
describe,  or  even  recognize  such  trends.  In  fact,  quite 
often  higher  levels  of  management  do  not  define  what,  their 
view  of  trend  analysis  is,  or  what  they  expect  of  it.  Even 
when  the  concept  is  adequately  described,  those  made  respon- 
sible for  its  use  are  r.ot  chosen  for  their  mathematical 
ability  or  background,  but  their  expertise  in  the  specific 
type  of  performance  that  is  being  measured.  That  is,  those 
involved  in  trend  analysis  need  a primer  on  statistical 
methods,  especially  when  it  comes  to  declaring  when  a trend 
is  significant  in  a statistical  sense  and  when  inferences 
can  or  cannot  be  made.  The  expertise  should  be  applied  to 
determining  why  there  is  a trend,  not  in  trying  to  deter- 
mine if  one  exists  or  not. 
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Objective 

The  ultimate  objective  of  the  thesis  is  to  write  a 
guide  or  primer  that  will  assist  those  responsible  for 
trend  analysis  and  related  statistical  methods.  This  guide 
should  be  flexible  enough  to  be  used  widely,  but  not  so 
general  that  its  generality  detracts  from  its  usefulness. 

It  should  also  be  written  in  a step-by-step  manner  easily 
understood  by  the  unsophisticated  (in  math)  but  at  the  same 
time  well  substantiated  by  references  and  discussion  in  the 
text.  This  substantiation  should  be  put  in  a logical  man- 
ner for  the  user  so  that  he  might  have  some  confidence  in 
the  methods,  if  only  from  a conceptual  point  of  view. 

Background 

An  aircrew  member  at  a base  in  Germany  originally 
suggested  a guide  of  this  sort  through  the  Air  Force  Sugges- 
tion Program.  As  wing  radar  officer,  one  of  his  responsi- 
bilities was  the  development  and  maintenance  of  a trend 
analysis  (TA)  program.  He  did  not  have  a lot  of  time  to 
spend  at  it,  since  TA  was  not  his  primary  job.  Furthermore, 
since  he  didn't  know  much  about  what  TA  was,  (History  major) 
he  was  spending  more  time  than  he  could  afford.  The  sugges- 
tion was  that  a publication  or  manual  be  written  on  what  TA 
was  and  "how  to  do  it".  It  would  save  many  man-hours, 
since  so  many  flying  organizations  had  such  an  additional 
duty  for  someone.  The  suggestion  came  through  channels  and 
ended  up  as  a possible  thesis  topic. 
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Subsequent  conversations  between  the  suggestor  and  the 
writer  allowed  the  general  topic  to  be  narrowed  down  some- 
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what.  One  of  the  most  common  requests  by  commanders  was 
for  TA  officers  to  be  able  to  say  when  changes  in  aircrew 
performance  scores  (such  as  ratio  of  targets  acquired  to 
targets  attempted  on  radar)  became  significant.  Also  of 
interest  was  the  amount  of  correlation  between  failures  to 
acquire  and  weather  factors,  equipment  malfunctions,  and 
so  on.  It  was  nentioned  that  while  there  was  a USAFE  Regu- 
lation requiring  TA,  there  was  no  guidance  about  how  to 
develop  suc’o  a program  (Ref  9:1-2). 
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Scope 

I will  limit  my  coverage  to  types  of  trend  analysis 
and  statistical  methods  mentioned  by  the  original  requestor 
and  chose  I have  seen  used  in  my  own  operational  experien- < 
as  aircrew  member  and  evaluator.  This  will  include  var 
types  of  time-series,  regression,  and  correlation  ana  /s'is> 
plus  non- parametric  methods  and  a thorough  discuss-'  1 
statistical  significance  for  each  method. 


Anticipated  Problems  and  Limitations 

A combination  of  the  aoove  mentioned  - iversation, 
subsequent  ones,  and  preliminary  researc’  Air  lorce 

manuals  pointed  out  a possible  problem  A statistical 
treatment  was  lacking  and  needed,  b\  more  futidamental  was 
the  lack  of  proper  guidance.  The  titer  decided  to  do  a 
scries  of  interviews  with  those  evolved  in  TA  to  get  an 


".a^a  about  how  widesprsad  this  facet  of  the  problem  was . 

Two  weeks  were  set  aside  for  this  series  of  telephone 
interviews.  Besides  the  motivation  mentioned  above,  the 
writer  felt  these  conversations  would  be  a last  chance  to 
find  out  that  the  problem  was  isolated  (not  worth  a thesis 
effort).  Also,  they  could  be  a source  for  problem  types, 
data,  and  references. 

Face- to' face  interviews  would  have  been  more  useful, 
but  there  was  no  time  for  that.  Although  the  interviews 
were  kept  unstructured  in  crder  to  keep  them  informal, 
answers  to  toree  specific  questions  were  sought  from  each 
interviewee.  Guidance  wac  never  mentioned  in  a question, 
since  I wished  to  see  if  the  interviewee  would  mention  lack 
of  guidance  on  his  own. 

The  first  question  was  what  their  own  definition  of  TA 
was.  There  were  answers  all  the  way  from  strictly  "records 
keeping"  to  simply  identification  of  this  month’s  weak 
area,  all  the  way  to  a comparison  over  tune  to  a set  air- 
crew performance  sf&idard,  In  this  last  office,  the 
officer  mentionea  that  when,  performance  measures  dropped  by 
37c,  it  was  identified  as  a trend. 

The  next  question  was  about  whether  each  felt  what  he 
was  doing  appropriate  or  not.  Nearly  everyone  interviewed 
was  a little  unsure  as  to  the  value  of  what  they  were  doing 
There  were  complaints  that  commanders  tended  to  "overreact" 
to  what  they  felt  were  insignificant  trends,  but  they  had 
no  real  basis  on  which  to  argue.  One  officer  said  he  was 
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fairly  happy  with  his  program  even  though  he  throws  away 
all  past  data  except  the  current  time-period  and  one  time- 
period  back.  He  said  he  did  this  because  the  rules  changed 
so  often. 

Lastly,  they  were  asked  about  what  they  would  want  a 
guide  to  do  for  them  if  they  could  have  one  made- to-order . 
Number  one  priority  was  a good  definition,  some  guidance 
about  what  TA  was  and  what  was  expected  of  it.  Number  two 
was  something  on  inferences , about  when  a trend  or  change 
becomes  significant.  Number  three  was  something  on  sample 
sizes  and  what  sample  sizes  were  reasonable  when  making 
inferences . 

This  discussion  of  anticipated  problems  (suggestor) 
and  confirmed  problems  (interviews)  leads  directly  to  the 
limitations  of  this  research  effort.  This  paper  will  not 
address  the  lack  of  guidance  issue  as  such,  it  will  have  to 
be  worked  out  between  the  user  and  his  boss.  It  is  felt, 
however,  with  the  statistical  definition  of  TA  which  will 
be  addressed,  the  user  will  be  better  equipped  to  come  to 
an  agreement,  or  at  least  some  common  ground  upon  which  to 
argue . 

This  paper  will  address  the  statistical  side  of  the 
question  and  provide  a tool  for  the  user  along  with  some 
idea  about  what  the  methods  can  and  cannot  do. 

Methodology 

In  addition  to  the  literature  search  of  Air  Force 
publications,  a DDC  and  Rand  bibliography  search  was 
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conducted.  Business  publication  indexes  were  also  reviewed 
since  I was  interested  not  only  in  whether  the  military 
already  had  such  a guide,  but  whether  there  was  something 
in  the  civilian  sector.  None  of  these  sources  treated  TA 
the  way  I felt  it  needed  to  be  treated.  As  I viewed  it, 
the  TA  officer  is  there  as  an  expert  crew-member,  not  an 
expert  mathematician,  and  may  not  understand  a sophisti- 
cated treatment.  What  he  needs,  then,  is  a translation  of 
these  sophisticated  methods  into  something  easy  to  apply. 

A wealth  of  material  was  found  in  various  professors' 
libraries,  the  Engineering  School  library,  the  Logistics 
School  library,  and  the  Wright  State  University  library. 

It  was  determined  that  the  methods  found  in  these  statis- 
tics and  business  texts  were  applicable,  particularly  those 
dealing  with  time-series,  straight-line  or  curve-fitting 
and  correlation.  Those  statistics  texts  written  for  busi- 
ness students  were  particularly  useful,  and  are  excellent 
references  for  the  user  of  the  guide. 

A sufficient  number  of  different  methods  are  discussed  in 
preliminary  chapters  to  give  the  user  a feel  for  each  method 
if  he  has  the  time  and  the  inclination  to  read  them.  Also, 
a3  much  discussion  of  statistical  inference  as  possible  due 
to  space  and  time  is  given.  The  guide  itself,  however, 
should  stand  alone  and  be  usable  to  one  who  has  not  seen 
the  remainder  of  the  paper.  Also,  each  method  is  self- 
contained  with  a minimum  of  references  to  other  methods. 
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An  example  is  given  for  each  method,  following  each  step 
from  the  guide.  Assumptions  necessary  for  each  method 
precede  the  first  step.  Cautions  and  notes  on  application 
follow  the  last  step  so  there  is  less  chance  of  mistakes. 
The  bibliography  is  divided  up  into  those  references  used 
for  method  translation  and  those  that  are  especially  use- 
ful to  the  user  of  the  guide. 


II.  Tests  of  Hypotheses 


Levels  of  Measurement 

Measurement  is  the  process  of  assigning  numbers  to 
objects  or  observations.  The  kind  of  measurement  which  is 
achieved  is  a function  of  the  rules  -under  which  the  numbers 
were  assigned.  The  operations  and  relations  employed  in 
obtaining  the  scores  define  and  limit  the  manipulations 
and  operations  which  are  permissible  in  handling  the  scores; 
the  manipulations  and  operations  must  be  those  of  the  numer- 
ical structure  to  which  the  measurement  is  compatible 
(Ref  6:29). 

Nominal . The  nominal,  name  only,  or  classificatory 
scale  of  measurement  is  measurement  at  it's  weakest  level. 
Numbers  may  be  assigned  to  these  classes  or  categories,  but 
they  serve  as  nothi.ng  more  than  symbols.  For  example,  if 
there  are  only  two  categories  the  measurements  may  fall 
into,  the  measurements  or  "responses"  may  be  symbolized  by 
yes/no,  plus/minus,  3.7/Z3  , etc.  Aircraft  tail  numbers 
are  nominal,  as  long  as  the  effect  of  when  or  where  they 
were  produced  is  negligible.  Due  to  the  nature  of  nominal 
data,  the  only  descriptive  statistics  that  may  be  used  are 
frequency  counts  and  the  mode.  Tests  of  hypotheses  (covered 
in  the  next  section) , regarding  the  distribution  of  cases 
among  categories  can  be  accomplished  using  some  non- 
parametric  methods,  and  those  based  on  the  binomial  expan- 
sion (Ref  6:23).  These  tests  are  appropriate  because  they 
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focus  on  the  number  of  measurements  in  each  category,  no 
matter  what  the  "name"  of  the  category  itself  is.  The 
mean  and  standard  deviation  cannot  be  generated  for  nominal 
data.  For  instance,  it  dees  not  make  sense  to  say  what  is 
the  average  (mean)  of  a yes  and  two  no  answers. 

Ordinal.  This  higher  level  of  measurement  implies 
that  one  category  is  not  only  different  from  all  other 
categories,  but  that  they  can  be  arranged  or  ranked  "in 
order"  according  to  some  measurable  characteristic.  The 
system  of  grades  in  the  Army  is  an  ordinal  system.  One  can 
say  Sergeant>Corporal>Private  (where  the  symbol  ">"  means 
"greater  than").  If  the  Private  has  one  stripe,  the  Cor- 
poral has  two  and  the  Sergeant  has  three,  the  important 
thing  is  the  relative  number.  That  is,  the  same  relation- 
ship would  hold  if  the  Private  had  one  stripe,  the  Corporal 
four,  and  the  Sergeant  ten.  Appropriate  statistics  now 
include  the  median,  since  it  is  defined  as  that  point  along 
the  scale  where  there  is  an  equal  number  of  higher  and 
lower  ranking  measurements.  Hypothesis  testing  may  be 
accomplished  by  any  of  the  non-parametric  methods  based  on 
ranking  plus  those  allowed  for  nominal  data.  Again,  the 
mean  and  standard  deviation  cannot  be  used.  For  example, 
one  rank  severity  of  malfunctioning  equipment,  particularly 
if  you  are  speaking  of  its  effect  on  a system  or  perform- 
ance of  a job.  But  how  would  you  get  the  average?  The 
problem  is  defining  the  scale  between  levels  of  severity, 
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they  all  seem  to  be  different.  It’s  like  trying  to  define 
average  pain. 

Interval.  This  next  higher  level  is  everything  that 
the  ordinal  scale  is,  plus  there  is  a consistent  unit  of 
measurement  defined  between  the  values . Our  temperature 
scales  are  of  this  type.  The  units  of  measurement  may  be 
different,  and  may  be  measured  from  different  arbitrarily 
chosen  zero  points,  but  differences  in  values  (intervals) 
on  the  scale  have  meaning.  Also,  the  difference  between 
10°F  and  2n°F  is  the  same  as  the  difference  between  110°F 
and  120°F.  The  same  is  true  for  Centigrade.  All  the  previ- 
ously mentioned  statistics  may  be  used  plus  all  those  based 
on  the  mean  and  standard  deviation.  An  average  temperature 
makes  sense,  now. 

Ratio . This  is  the  highest  level  of  measurement.  A 
shortcoming  of  the  interval  scale  is  the  absence  of  a 
natural  zero  point.  Due  to  this  lack,  one  cannot  say  that 
20°F  is  twice  as  warm  as  10°F.  On  a ratio  scale,  however, 
one  may  say  that  two  inches  is  twice  as  long  as  one  inch. 
The  unit  of  measurement  again  may  be  picked  arbitrarily, 
but  now  a ratio  in  one  system  of  units  is  the  same  as  a 
ratio  in  any  other  system,  thus  the  name  ratio  scale.  For 
instance,  a length  of  rope  may  be  twice  as  long  as  another 
no  matter  what  unit  is  used.  All  previously  mentioned 
statistics  are  appropriate  here  including  the  geometric 
mean  and  coefficient  of  variation,  which  will  not  be  needed 
in  this  guide. 
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Hypothesis  Testing 

The  bulk  of  what  statisticians  do  can  be  regarded  as 

testing  hypotheses.  Webster  gives  the  definition  of 

hypothesis  as  the  following: 

....an  unproved  supposition  or  propo- 
sition (working  hypothesis)  tentatively 
accepted  to  explain  certain  facts  or  to 
provide  a basis  for  further  investiga- 
tion. 

The  user  does  not  know  what  the  true  "state-of-nature"  is, 
so  he  makes  a supposition,  then  checks  (tests)  whether  the 
data  available  supports  the  hypothesis.  This  hypothesis  is 
called  the  null  hypothesis , Hq,  and  he  either  accepts  or 
rejects  this  null  hypothesis.  When  he  rejects  Ho,  he 
rejects  it.  in  favor  of  some  alternative  hypothesis  H^. 

Then  he  takes  some  action  on  the  basis  of  whether  he 
decided  to  accept  or  reject  Hq.  We  now  have  two  states- 
of-nature  and  two  actions  possible.  If  these  are  arranged 
as  shown  in  Fig  1,  it  is  obvious  that  there  are  four  out- 
comes possible,  two  correct  decisions  and  two  incorrect 


^Action\ 

STATES  OF  NATURE 

S : H True  S, :H  Not  True 

o o I 0 

Accept  H 

Correct  Decision 

Incorrect  Decision 

o 

Type  II  Error 

Reject  H 

Incorrect  Decision 

Correct  Decision 

o 

Type  I Error 

- 

Fig  1.  Type  I,  Type  II  Errors 
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ones  (errors).  The  error  of  rejecting  a true  null  hypoth- 
esis is  called  a Type  I error,  and  its  probability  of 
occurrence  is  called  a.  Likewise  the  error  of  accepting  a 
false  null  hypothesis  is  called  a Type  II  error  and  its 
probability  of  occurrence  is  called  8 . 

Of  course,  there  is  more  to  the  "action"  than  just 


accepting  or  rejecting  R . For  instance,  let  us  say  our 


states-of-nature  are  whether  it  will  be  fair  (under  H ) or 


rain  (under  H^)  and  the  actions  whether  we  will  leave  our 

umbrella  (A  ) or  take  it  (A-,).  Our  null  and  alternative 
o i 

hypotheses  are: 


H : It  will  be  fair, 
o 


It  will  rain. 


Our  diagram  will  then  look  like  Fig  2. 

The  decision  here  would  be  made  on  the  basis  of  a 
weather  report.  In  statistics  we  don't  have  to  rely  on  a 
prediction,  usually,  but  instead  on  a sample  of  the  real 
but  undescribed  population,  which  fortunately  can  at  least 


ActionN\ 

STATES 

S : H True 

0 0 

OF  NATURE 

S,  : H Not  True 

I 0 

Accept  H, 

Correct  Decision 

Get  Wet 

o 

8=p (Getting  Wet} 

Reject  H 

Look  Silly 

Correct  Decision 

0 

a=p {Looking  Silly} 

Fig  2.  Umbrella  Example 
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be  quantified.  Actually,  when  we  say  "hypothesis"  in 

statistics,  we  mean  a frequency  distribution  (Ref  10:231). 

When  we  accept  or  reject  Hq,  then,  we  are  accepting  or 

rejecting  whether  our  sample  came  from  the  hypothesized 

population  distribution.  If  our  sample  mean  y is  too  much 

different  from  a hypothesized  population  mean  p,  we  rnt.^t 

reject  the  hypothesis.  Assuming  normality  the  tabulated 

value  of  the  test  statistic  z is  a measure  of  how  close 

a 

y and  p should  be  at  a given  a.  The  computed  value  of  the 
z-statis tic 


(from  our  data)  is  then  compared  to  the  tabulated  value  and 
the  hypothesis  is  accepted  or  rejected  accordingly. 

A word  here  about  measurement  scales.  To  use  Eq  (2-1), 
the  data  must  be  at  least  interval  (i.e.,  interval  or  ratio) 
since  the  mean  p and  standard  deviation  a are  involved. 
Testing  of  hypothesis  about  binomial  populations  can  usu- 
ally be  reduced  to  testing  some  hypothesis  about  the  param- 
eter p,  the  probability  of  occurrence  (Ref  2:101).  That  is, 
one  assumes  a distribution  with  parameter  p,  then  checks 
whether  the  sample's  frequency  of  occurrence  is  consistent 
with  the  p of  the  distribution.  These  binomial  tests  may 
be  done  with  ordinal  or  even  nominal  data. 

Since  we  are  dealing  with  distributions,  things  are 
more  complicated  than  our  umbrella  example  (Fig  2)  shows, 
but  we  may  use  it  to  illustrate  another  very  important  point 


We  have  said  nothing  about  which  type  error  we  most  wish  to 
avoid.  If  we  embarass  easily,  we  may  wish  to  avoid  looking 
silly  carrying  an  umbrella,  so  we  concentrate  on  keeping 
that  error  as  small  as  possible.  But  what  if  we  were  going 
directly  to  a job  interview?  If  we  got  wet  we  would  make  a 
bad  impression.  Normally  what  is  done  here  is  change  our 
Hq  and  states-of-nature  as  shown  in  Fig  3,  while  still 
keeping  our  level  of  significance  (probability  of  Type  I 
error),  a,  at  its  same  small  value.  In  this  context  a is 
also  called  the  critical  level,  where  we  reject  HQ. 


X 

STATES 

OF  NATURE 

ActioiX, 

S : H True 

O 0 

S^  Hq  Not  True 

(It  Will  Rain) 

(It  Will  Be  Fair) 

Accept  Hq=> 

A : Take 
Umbrella 

Correct  Decision 

Look  Silly 
p{Looking  Silly}=$ 

Reject  Hq=> 

A, : Leave 
Umbrella 

i 

Get  Wet 

p {Getting  Wet}=a 

..  . 

Correct  Decision 

Fig  3.  Ho,  Switched 


Manipulating  Ho  to  reduce  the  error  we  wish  most  to 
avoid  is  analogous  to  burden-of-proof  considerations  in 
criminal  justice.  A person  is  innocent  until  proven  guilty 
"beyond  a reasonable  doubt".  We  wish  to  avoid  punishing  an 
innocent  man  so  much  we  are  willing  to  risk  letting  a 
guilty  man  go  free.  The  Type  I error  is  the  one  we  control 
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to  a certain  specified  level  so  we  set  up  our  hypotheses 
so  that  the  burden-of-proof  is  on  the  prosecution.  Like- 
wise, if  we  embarass  easily,  we  want  to  leave  the  umbrella 
at  home  unless  we  are  convinced  (beyond  a "reasonable 
doubt")  that  it  will  rain.  We  could  have  just  as  well  set 
H up  so  that  we  would  take  the  umbrella  unless  we  were 
convinced  (beyond  a "reasonable  doubt")  that  it  will  be 
fair. 

Again,  with  statistics  we  are  somewhat  better  off  than 
the  job-hunter  and  the  weather.  With  our  sample  mean  y, 
population  standard  deviation  a,  and  our  formula,  Eq  (2-1) 
we  can  get  both  a decision  rule  (for  action)  and  compute 
the  probability  £ of  the  Type  II  error.  Remember,  we  have 
already  specified  what  we  want  a to  be.  A hypothetical 
example  will  illustrate  this. 

Suppose  a commander  had  a campaign  to  improve  emer- 
gency procedure  scores  in  his  wing.  He  wishes  to  throw  a 
party  if  the  overall  average  came  up  to  their  goal  of  90% 
Naturally,  as  mentioned  before,  we  are  dealing  with  a 
distribution,  and  the  90%  is  the  mean  of  a distribution. 

The  commander  is  hypothesizing  that  Hq:  y=90%  and  wishes  to 
test  this  hypothesis.  Liking  parties,  he  wants  the  risk 
of  rejecting  HQ  (when  it  is  actually  true)  and  not  throwing 
the  party  to  be  small,  say  a=.05.  To  test  the  hypothesis 
he  takes  a sample  of  9 crew  members  (n=9)  and  gives  them 
the  test. 
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Our  diagram  is  as  shown  in  Fig  4a.  It  can  be  depicted 
using  distributions  as  shown  in  Fig  4b.  Normally  in  statis- 
tics our  alternative  hypothesis  in  this  situation  would  be 
H^:p<907o  since  our  action  after  rejecting  Hq  and  accepting 
Fh  would  be  the  same  at  several  values  of  y<9C%  (action 
or  no  party).  However,  all  lower  values  of  y<90%  affect 
only  6,  and  have  no  effect  on  our  chosen  a.  The  commander 
is  only  interested  in  whether  he  should  have  the  party  or 
not,  not  how  bad  his  crewmembers'  averages  really  were.  If 
he  were,  he  might  well  have  used  a different  null  hypothesis 
The  alternative  hypothesis  H^:  y=87%  above,  was  picked 
strictly  for  illustration.  This  specific  value  will  enable 
us  to  compute  the  probability  of  a Type  II  error  8.  The 
straight  vertical-line  notation  P(A-JSo)  means  the  proba- 
bility of  not  throwing  the  party  (A^)  given  that  the  true 
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Populate  ^ mean  is  907>  true) . This  is  our  probability 

7 je  I error  a.  Here  a=.C  equals  the  probability 

(SQ).  This  is  csllec  the  conditional  probability  of 

</er  the  state-r^-nature  S„  exists. 

o 


The  comm  - wants  a value  y*  somewhere  between  87 
and  90  such  fcha  men  a sample  y is  taken  from  his  wing  and 
it  is  better  than  y*,  (y- y*)  he  will  give  the  party  and  if 
y<y*  he  will  not,  i.e. , a decision  rule . If  he  knows  from 
experience  that  c=37<>  and  he  plans  to  test  a sample  of  size 
n~  9 t he  can  compute  this  critical  value  y*  using  formula, 
Sq  (2-1)  as  follows: 


y*-y( 


a- . 05 


a//iT 


1.645  = 

3/ *9 


Solving  for  y* , y*  = 88 . 4 


This  says  if  his  sample  mean  yd88.4%  he  accepts  Hq  and 
gives  the  party,  but  if  y<88.470  he  rejects  Hq  and  does  not. 
Put  another  way,  there  is  only  an  a=.05  probability  that 
the  sample  came  from  a population  with  a mean  of  907>  if  the 
sample  mean  is  less  than  88.470. 

We  can  also  use  formula,  Eq  ('2-l)  to  compute  our  3 
associated  with  this  particular  alternative  hypothesis  or 
state-of-nature.  We  compute  z: 
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y-ws 

^ = i = 68.»  4-87  = ^ 4 

^ cr//rT  3//F 


The  area  tmder  this  part  of  the  lower  curve  in  Fig  4b 
associated  with  the  commuted  value  of  Z is  .081.  So  we 
have  g=.081.  We  can  plot  various  values  of  6 for  various 
alternative  hypotheses  H^.  They  will  look  something  like 
the  lines  :o  Fig  5.  When  the  curve  (called  an  Operating 
Characteristic  or  OC  curve)  is  high  near  the  value  Hq  and 
low  away  from  Hq  it  indicates  the  ability  of  the  decision 
rule  to  distinguish  between  the  hypothesized  state-of- 
nature  SQ  and  possible  alternative  states-of-nature 
associated  with  the  many  alternative  hypotheses.  The  value 
that  determines  the  steepness  of  the  curve  (quali ty  or 
power  of  decision  rule)  is  the  sample  size  n,  as  indicated 


in  Fig  5,  Operating  characteristic  curves  are  given  in 


most  statistics  books.  Normally  one  is  not  as  concerned 
about  the  value  of  6 as  he  is  the  value  of  a,  but  he  should 
realize  there  is  a tradeoff  between  sample  size  and  the 
risk  of  a Type  II  error  for  fixed  a.  For  a fixed  n there 
is  a trade  off  between  a and  6. 
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Ill . Methods  from  the  Literature 

Purpose  and  Overview  of  Chapter 

Purpose . The  purpose  of  this  section  is  to  present 
and  discuss  the  methods  the  author  feels  are  applicable  to 
measurements  handled  by  Air  Force  trend  analysis  and 
Standardization/Evaluation  offices.  These  methods  will  be 
presented  in  much  the  same  manner  as  in  representative 
textbooks.  If  the  reader/user  wishes  to  skip  this  chapter 
and  go  directly  to  the  "guide"  due  to  lack  of  time,  inter- 
est, or  thinks  he  may  not  have  sufficient  background  to 
follow,  he  may  do  so  here.  Certain  important  assumptions 
(sample  size,  types  of  measurements,  etc.)  will  be  repeated 
there,  to  re-emphasize  and  to  assure  all  are  considered  in 
gathering  and  analyzing  the  data.  It  is  highly  recommended, 
however,  that  this  section  be  reviewed  sometime,  because  it 
may  give  the  user  a better  "feel"  for  the  methods  than  just 
following  the  steps  of  the  algorithm. 

Time -Series . Data  classified  by  time  is  called  Time- 
Series  data  (Ref  7:18).  This  research  effort  does  not 
strictly  address  only  time-series  methods,  but  these  will 
be  the  methods  concentrated  on.  Although  the  word  "trend" 
makes  one  think  of  changes  over  time,  those  put  in  the  job 
of  trend  analysis  find  themselves  responsible  for  other 
types  of  statistical  manipulations,  such  as  correlation  of 
two  independent  variables.  It  is  possible  attempts  are 
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made  in  the  field  to  describe  correlation  of  more  than  two 


independent  variables,  but  these  methods  will  not  be 
covered. 

Overview.  Due  to  an  overwhelming  number  of  questions 
from  the  field  about  statistical  significance  of  trends,  a 
description  of  the  concept  of  control  charts  will  be  given. 
It  is  planned  to  use  the  control  chart  with  all  methods  in 
the  guide  except  correlation  and  non-parametric  methods . 
Straight  line  least- squares -best- fit  (LSBF) , the  most  visu- 
ally appealing  method,  will  then  be  covered.  The  LSBF 
method  will  be  extended  to  a non-linear  model,  the  fitting 
of  a parabola.  Correlation  between  two  variables  will  also 
be  covered,  with  emphasis  on  the  pitfalls  of  correlation 
vs.  causality. 

The  next  processes  covered  will  be  those  used  primar- 
ily in  the  business  world,  moving  averages  and  exponential 
smoothing.  These  methods  are  widely  used  in  business  fore- 
casting and  provide  a simpler,  quicker  method  of  reacting 
to,  or  computing,  new  data  points  in  the  model. 

Lastly,  non-parametric  methods  will  be  presented. 

These  are  the  most  general  and  sometimes  the  easiest  to 
apply,  but  as  a result  are  not  always  as  powerful.  Cox  and 
Stuart,  Kendall's  and  Spearman's  tests  for  trend  will  be 
addressed. 

Control  Charts 


Air  Force  Standardization/Evaluation  requires  crew 


Fig  6 . Control  Chart 

members  to  perform  at  some  given  standard.  They  are  tested 
periodically  against  that  standard.  This  process  is  quite 
analogous  to  manufacturing,  where  a quality-control  inspec- 
tor samples  machined  parts  to  determine  if,  on  the  average, 
they  are  within  tolerances . 

Let  us  say  we  wish  to  maintain  a standard  of  907, 
correct  responses  on  Emergency  Procedures  examinations.  We 
could  construct  a chart  like  the  one  shown  in  Fig  6.  The 
discontinuity  near  the  origin  is  strictly  to  save  space, 
and  not  to  exaggerate  small  vertical  differences.  In 
industry,  several  kinds  of  control  charts  are  used.  Qual- 
ity control  engineers  look  for  trouble  when  a sample  point 
falls  beyond  the  control  limits.  One  source  mentioned  that 
even  when  the  points  fall  between  the  control  limits , a 
trend  or  some  other  systematic  pattern  may  serve  notice 
that  action  should  be  taken  to  avoid  serious  trouble 
(Ref  4:421) . 
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Normally  control  charts  have  both  an  upper  control 
limit  (UCL)  and  a lower  control  limit  (LCL) . In  the  case 
of  test  scores,  however,  commanders  would  most  likely  be 
interested  in  scores  displaying  a trend  toward  the  LCL, 
unless  he  was  looking  for  some  policy's  positive  effect  on 
already  good  scores. 

Let  vis  say  he  is  interested  in  just  the  LCL.  If  there 
were  a known  population  mean  and  standard  deviation,  y and 


a,  we  could  establish  I,CL=y- 2 


a/n" 


The  value  z may  be 


thought  of  as  the  number  of  sample  standard  deviations 
from  the  mean  of  the  sampling  distribution.  The  parameter 
a is  the  probability  a sample  mean  would  lie  at  least  z 
standard  deviations  from  the  presumed  or  hypothesized  popu- 
lation mean,  or  below  the  LCL.  The  "n"  stands  for  the 
sample  size.  Here  we  are  assviming  that  y=90%  and  we  wish 
to  know  when  scores  begin  to  deviate  significantly  from 
this  standard  or  assumed  population  mean. 

We  very  rarely  know  the  population  standard  deviation 
for  sure,  but  if  our  sample  size  is  large  enough,  n>30,  we 
may  substitute  the  sample  standard  deviation  "s"  for  o 
(Ref  5:169).  Also,  if  n is  large,  the  distribution  need 
not  be  normally  distributed. 

If  n is  not  large,  but  the  assumption  of  normality  is 

s 

a reasonable  one,  we  may  use  the  expression  LCL=90-ta-pr 
where  t is  the  value  of  a random  variable  having  the 
Student- t distribution  with  the  parameter  v=n-l.  The  t- 
distribution  is  well  tabulated  and  widely  available. 
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The  parameter  "v"  is  called  Degrees  of  Freedom  (DF) . The 
parameter  a means  the  same  as  in  previous  treatments. 


In  order  to  have  straight  horizontal  lines  for  control 
limits,  the  quantity  standard  deviation  s must  be  assumed 
to  be  nearly  constant.  If  this  is  not  a reasonable  assump- 
tion, there  will  have  to  be  limits  computed  and  plotted  for 
each  time  period.  This  does  not  invalidate  the  use  of  the 
control  chart,  however,  but  it  does  make  it  a bit  more 
difficult  and  cluttered. 

We  now  have  a standard  with  which  to  compare  our  trend 
and  a value  (LCL)  where  our  measurements  or  observations 
(test  scores,  say)  start  to  become  significant  at  some 
level.  The  parameter  a,  in  this  context,  again,  is  called 
the  level  of  significance. 

Linear  Regression  or  Least-Squares-Best-Fit 

Many  times  a series  of  data  points  suggests  a straight 
line.  In  fact,  che  eye  is  known  to  attempt  to  see 
"patterns"  that  may  not  even  be  there,  so  a person  may 
easily  see  a line  in  a string  of  points  over  time.  In 
mathematical  terms,  this  says  the  relationship  between  x 
(the  independent  variable)  and  y (the  dependent  variable) 
or  the  regression  of  y on  x,  can  be  described  by  the  rela- 
tion y-B0*i-3-j_x  where  a is  the  y-axis  intercept  and  b is  the 
slope  of  the  straight  line.  In  statistical  terms,  we  say 
there  is  such  a line  that  best  fits  the  data.  Here  y is 
a point  on  the  line,  and  though  it  may  not  be  one  of  our 
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Fig  7.  Regression  Line  with  No  Error 

observations,  it  is  (or  assumed  to  be)  the  mean  of  the 
distribution  of  y's.  Actually,  it  is  the  mean  of  the  y's 
given  a value  of  x,  or  the  Expected  Value  E(y|x).  See 
Fig  7. 

In  general,  an  individual  observation  y,  since  it  is 
a measurement,  may  contain  some  error;  furthermore,  the 
true  but  unknown  relationship  may  not  be  exactly  linear, 
so  there  is  one  more  term,  e,  which  contains  all  the 
errors . That  is , e contains  both  random  errors  of  measure- 
ment and  systematic  errors  due  to  an  incorrect  relationship 
being  fitted. 

The  relation  can  now  be  described  by  the  equation 


y±  = E(y|xi) 


or  yi 


Bo+6lxi+£i 


(3-1) 


as  mentioned  above.  We  can  always  theoretically  choose 
BQ  such  that  the  mean  of  the  random  variable  e is  zero 
with  certain  assumptions  mentioned  later.  The  method  of 
least  squares  estimates  the  parameters  BQ  and  B-|_  by  mini- 
mizing the  vertical  distance  between  the  line  and  our 
points.  If  e,  a,  and  b are  the  estimators  of  our  s,  BQ, 
and  B^,  then  we  are  finding  the  values  that  minimize  this 
equation: 

l ei  = l (yi-Ca+bx.)) 2 (3-2) 

i=l  i=l 

n 

From  here  on,  £ or  just  S will  mean  summation  over  the  n 
i=l 

subscripted  arguments. 

Taking  the  partial  derivatives , setting  them  equal  to 
zero,  and  solving  them  simultaneously,  give  us  the 
estimators : 


a = y-bx 


and 


b = -22. 
^xx 


.0-  J) 


where  x and  y are  the  means  of  x and  y and: 


S 


xx 


nix?- (Ix^) 2 


(3-4) 


S = nix. *y . - (lx. ) (ly . ) 
xy  x J x s i' N J i/ 


(3-5) 


The  Gauss-Markov  Theorem  states  that  among  all  unbiased 
estimators  for  BQ  and  B]_  which  are  linear  in  the  y^,  the 
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least  square  estimators  a and  b have  the  smallest  variance 
and  are  the  most  reliable  in  the  sense  that  they  are  sub- 
ject to  the  smallest  chance  variation  (Ref  10:396).  This 
theorem  assumes  the  following  properties : 

1.  The  x^’s  are  fixed  (months,  for  instance). 

2.  The  y^'s  are  random  variables  (distri- 
bution not  specified) . 

3.  The  e are  independent  (not  necessarily 
normal)  but  Var (e)=Var (y) . 

These  should  be  reasonable  assumptions  for  trending  of 
such  things  as  test  scores  over  time. 

One  even  more  basic  assumption,  however,  is  that  the 
time  periods  are  independent  of  each  other.  Normally  one 
can  make  this  assumption  if  there  are  no  cycles  or 
seasonal- type  variations  over  time.  If  there  are  such 
cycles  over  time,  other  time-series  methods  may  be  used  to 
minimize  their  effect.  This  is  not  the  only  reason  why 
one  may  wish  to  use  other  methods.  Computing  parameters 
for  a new  regression  line  each  time  period  gets  tedious. 
The  section  following  correlation  deals  with  such  simpler 
methods,  starting  with  moving  averages.  In  addition,  the 
assumptions  underlying  these  methods  are  not  so 
restrictive . 

The  case  where  the  regression  of  y on  x is  linear 
leads  to  fitting  other  non-linear  curves  to  data.  For 
example,  some  functions  can  be  transformed  into  linear 
functions  such  as  y=a*bx,  which  can  be  transformed  into 
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log  y = log  a + x log  b by  taking  the  logarithm  of  both 
sides  of  the  equation.  Likewise,  y=a*xD  can  be  trans- 
formed into  log  y = log  a + b log  x,  again  by  taking 
logarithms . 

We  will  not  be  dealing  with  these  transformations  in 
the  guide,  but  will  be  using  a polynomial  regression,  that 
is,  the  fitting  of  a parabola  of  the  type  y=a+b‘x+c*x2 . 

The  method  is  the  same  as  for  the  straight-line  model, 
except  the  following  three  equations  must  be  solved  for  the 
three  unknowns  a,  b,  and  c: 


na+(£x)b+(Ex2) c = Ey 

(3-6) 

(Ex)a+(Ex2)b+(Ex3)c  = Exy 

(3-7) 

(Ex2)a+(Ex3)b+(Ex4)c  = Ex2y 

(3-8) 

Correlation.  In  correlation  analysis  both  variables 
are  random  variables  with  some  distribution.  Relating 
this  to  Fig  7 from  the  linear  regression  section,  we  say 
not  only  is  there  a distribution  of  y's  given  a value  of  x, 
but  there  is  also  a distribution  of  x's  given  a value  of  y. 
The  distribution  at  any  one  point  mrght  look  something 
like  Fig  8.  Such  distributions  are  referred  to  as  joint  or 
bivariate  distributions.  The  regression  analysis  of  y on  x 
(previous  section)  implied  that  y was  dependent  on  x. 
Correlation  analysis  treats  the  data  both  ways,  symmetri- 
cally, and  is  neutral  concerning  the  direction  of  the 
dependency.  For  our  uses,  this  means  it  is  up  to  the 
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decision-maker  to  speculate  on  the  cause-effect  relation- 
ship, as  the  numbers  just  give  the  magnitude  of  the 
co-relationship . 

For  such  distributions  the  correlation  coefficient  p 
is  defined  by  the  equation 


. 2 _ 


(3-9) 


where  a2  is  the  variance  of  the  conditional  distribution 
g2(y|x)  and  a22  is  the  variance  of  the  marginal  distribution 
f2(y).  What  is  meant  by  the  conditional  and  marginal 
distributions  is  shown  in  Fig  9.  The  formula  is  derived 
in  much  the  same  manner  the  parameters  were  for  simple 
linear  regression,  except  that  errors  are  minimized  from 
both  directions.  Note  that  p2  can  be  written: 


, 2 - 


= 1 - 


o£ 


02 


(3-10) 


We  know  a2  is  a measure  of  the  variation  of  the  y's  when  x 
is  held  fixed  while  a/  is  a measure  of  the  variation  of 
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Fig  9.  A Joint  Distribution  Frequency  Table 

the  y's  when  x is  not  held  fixed.  Our  p2,  then,  is  a 
measure  of  the  proportion  of  the  variation  in  the  y's  that 
can  be  attributed  to  a linear  relationship  with  x 
(Ref  1:323). 

The  correlation  coefficient  p may  be  estimated  by  the 
sample  correlation  coefficient 

Sxv 

r - S' (3-11) 

/ S • S 
xx  yy 

where  S , S , and  S,f„  are  defined  as  in  the  last  section, 
xy  y y 

The  null  hypothesis  Hq:  p=0  may  be  tested  using  the  z- 
statistic 


Sj 
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Several  cautions  are  appropriate  here.  In  addition 
to  the  same  assumptions  about  normality  and  independence 
(of  errors)  required  for  straight-line  LSBF,  one  must  be 
aware  that  correlation  coefficients  based  on  relatively 
small  samples  are  generally  not  very  reliable  (Ref  4:326). 
Also,  r is  a measure  of  the  strength  of  a linear  associa- 
tion between  the  random  variables.  As  illustrated  in 
Fig  10,  r could  show  up  as  close  to  zero  when  in  fact  there 
is  a surong  (but  nor.- linear)  relationship  (Ref  4:327). 
Thirdly,  •>c  mentioned  in  the  beginning  of  this  section,  a 
significant  correlation  does  not  necessarily  imply  a 
causal  relationship  between  the  variables. 


Types  Used  in  Business  Forecasting 

The  methods  in  this  section  will  be  particularly 
useful  when  there  are  periodic  fluctuations  over  time.  For 
instance,  if  low  visibility  weather  affects  visual  or  radar 
bombing,  then  it  is  conceivable  that  bombing  scores  may 
have  cyclical  or  seasonal  variations  over  several  years. 

In  our  linear  regression,  we  assumed  that  time  (independent 
variable)  was  also  independent  of  itself.  Another  way  of 
putting  this  is  "no  autocorrelation".  In  our  example 
above,  the  time  of  the  year  has  traits  that  are  definitely 
not  independent  of  whether  the  last  few  months  were  the 
end  of  Winter  or  the  end  of  Summer.  Our  LSBF  line  could 
indicate  an  erroneous  "trend"  if  it  started,  say,  in  the 
Spring  and  went  for  several  months . 

The  following  methods  tend  to  "dampen  out"  such  cycli- 
cal effects.  In  addition,  fewer  calculations  are  necessary 
when  a new  data  point  is  obtained,  and  some  methods  permit 
a forecaster  to  place  more  weight  on  current  data,  as 
opposed  to  equal  weighting  of  all  data. 

Although  no  specific  assumptions  are  necessary  for 
applying  these  methods,  when  they  are  used  with  control 
charts  to  test  significance  of  trends,  all  control  chart 
assumptions  must  be  satisfied. 

Moving  Averages . The  method  of  simple  moving  averages 
simply  takes  the  lavSt  N measurements,  computes  their  arith- 
metic mean  called  m^  and  this  arithmetic  mean  becomes  the 
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forecast  for  the  next  T periods.  As  each  new  data  point 
is  acquired,  it  is  figured  in  while  the  oldest  one  is 
eliminated.  A small  number  N (say  3 to  5)  will  respond 
more  quickly  to  large,  permanent  changes  in  the  response 
variable  than  larger  N's  (more  than  10),  but  will  also 
respond  too  quickly  to  anomalies  or  noise  (Ref  8:88).  The 
moving  average  formula  for  time  period  x^  is : 

N 

l y± 

Mi  - i=V~  (3-13> 

yi“yi-N 

or  + (short  formula)  (3-14) 

where  the  y^'s  are  the  observed  responses  at  time  period 
x^.  The  short  formula  may  be  used  after  computing  the 
first  M^. 

Double  moving  averages  improve  the  response  to  changes 
in  trend  while  keeping  the  risk  of  responding  to  an  out- 
lier small.  Here,  two  moving  averages  are  computed.  The 
moving  average  is  computed  as  in  Eqs  (3-13)  or  (3-14) , 
but  is  now  called  M^,  and  is  computed  as  the  average  of 
the  last  N M^'s.  These  two  averages  are  then  used  to 
determine  an  equation  for  values  of  the  dependent  variable 
a certain  number  of  periods  T=l,2,...  in  the  future.  This 
equation  is 
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(3-15) 


y.+T  = a.  + b.-T 


where 


a.  = 2M^  - 
1 11 


and 

b.  = 

r 2 

- i4z] 

l 

N-l 

1 X 

. > 

This  double  moving  average  is  quite  good  for  linear  trends 
but  will  not  give  accurate  results  when  there  is  curvature 
in  the  data.  While  there  is  a triple  moving  average  that 
addresses  curvilinear  data,  triple  exponential  smoothing, 
covered  in  the  next  section,  is  normally  used  in  that  situ- 
ation (Ref  8:90). 

Exponential  Smoothing.  Moving  average  mehtods,  when 
one  is  using  a relatively  large  N (>200) , can  require 
storage  of  a prohibitive  number  of  data  points.  Normally 
this  is  not  a problem  with  the  types  of  trending  involved 
here,  but  this  method  is,  nevertheless,  more  flexible  in 
how  it  can  react  to  new  data.  The  following  is  a develop- 
ment of  the  general  exponential  smoothing  model  (Ref  8:91). 
In  the  short  formula  if  i were  6,  for  instance,  and  N were 
5,  we  would  have 

= M&]  + — — — (3-16) 


If  we  stored  none  of  the  data,  a value  of  y^  would  not  be 
available.  Our  best  estimate  of  y^  would  then  be  M^.  By 
using  this  in  our  equation,  we  get 
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- tf  + 


y6-M^ 

5 


I rfO  + yJ. 
5 5 5 


*0  = - 
6 5 5 


For  the  general  case, 


- y.  + 
N *3- 


(3-17) 


(3-18) 


The  above  equation  could  be  used  to  calculate  Mu  if  no 

data  were  stored.  If  we  let  r~  and  and  substitute 

N ix 

these  into  the  last  equation,  we  arrive  at  the  basic  expo- 
nential model: 


SW  = r y . + (l-r)S^_1 


or 


r 


new 

data 


previous 

estimate 


(3-19) 


The  term  r,  is  called  the  smoothing  constant.  In  general, 
r should  lie  between  0.01  and  0.30,  but  the  forecaster 
should  not  hesitate  to  use  a value  outside  this  range  if  it 
gives  better  results.  It  could  be  very  profitable  to 
periodically  pick  an  arbitrary  point  in  the  past,  apply 
forecasting  techniques  and  see  how  well  they  did.  This  can 
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also  be  a tool  for  picking  or  changing  your  own  value  of  r. 
The  most  important  fact  concerning  r is  that  a larger  r 
places  more  importance  on  the  most  recent  data  (Ref  8:92). 

An  argument  similar  to  the  one  for  going  from  the 
simple  to  the  double  moving  averages  model,  Eq  (3-13)  to 
Eq  (3-15)  will  be  used  to  go  from  simple  to  double  expo- 
nential smoothing.  The  basic  exponential  smoothing  model 
is  best  when  there  is  very  little  trend.  To  describe  or 
react  to  a linear  trend  in  data,  a straight-line  model 
similar  to  Eq  (3-15)  should  be  used.  Such  is  the  case  with 
the  parameters  a^  and  b^  developed  from  the  double  exponen- 
tially smoothed  statistic  = rS$+(l-r)^_^.  That  is, 
the  following  equation: 


where 


y1+T  - + b<-T 


a.  = 2S$  - S® 
l r l 


and  b.  = JL.  f$J  - &)]  (3-20) 

1 J--r  i 1 ~J 

Again,  if  the  observations  suggest  a curvature  of  some 
sort,  one  would  assume  a quadratic  model  may  be  appro- 
priate. Again,  a new  statistic  is  needed,  the  triple 
exponentially  smoothed  statistic  S^,  where 


The  model  itself  is  of  the  form: 


yi-fT 


a.  + b.T  4 
1 1 


2 


where 


a. 

l 


3S$  - 3S$  + S$ 

i ix 


K 

“i 


r 

2 (1-r) 2 


(6-  5r  ) sW-  2 (5-4r)  S$f  (4-  3r)  S$ 

. 


c. 

1 


2 (1-  r)  2 


2$l  4-  S® 

l l 


j 


(3-22) 


Non-Parametric  Methods 

These  methods  are  more  generally  applicable  than  those 
described  in  the  first  section,  the  parametric  methods.  If 
the  data  cannot  be  assumed  to  come  from  a normal  distribu- 
tion and  sample  sizes  are  small,  these  tests  may  be  used. 
They  may  also  be  used  if  measurements  are  only  nominal  or 
ordinal.  Since  they  are  more  general,  it  is  to  be  expected 
that  they  will  not  be  quite  as  powerful  as  the  standard 
methods  when  both  are  applicable  (Ref  2:170). 

Binomial  Tests.  The  first  method  presented  is  the 
Gox-Stuart  test  for  trend.  This  test  is  based  on  a varia- 
tion of  the  sign  test,  which  in  turn  is  based  on  the 
Binomial  distribution.  This  chapter  will  not  go  into  a 
discussion  of  the  distribution,  but  assumes  the  reader 
tinders tands  that  it  would  apply  if  we  are  dealing  with  only 
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two  outcomes  with  probabilities  p and  1-p.  The  outcomes 
may  be  success-failure,  heads-tails,  or,  in  our  "sign" 
case,  plus-minus-  We  will  be  testing  whether  the  propor- 
tion of  plus  (or  minus)  signs  is  consistent  with  the 
binomial  distribution  possessing  the  assumed  or  hypothe- 
sized parameters.  Binomial  tests  are  so  versatile  and  can 
be  applied  so  simply,  they  are  sometimes  preferred  over 
more  powerful  tests  (Ref  1:95).  This  was  an  important 
consideration  when  choosing  statistical  tests  for  this 
guide . 

Cox  and  Stuart  Test  for  Trend.  The  Cox  and  Stuart 
test  for  trend  (Ref  1:130)  takes  the  sequence  of  random 
variables  y^,  y ••*,  y^,  arranged  (in  our  case)  in  the 
order  in  which  they  were  observed.  Then  these  observa- 
tions are  grouped  in  pairs  (y-^,  y^+(?) . That  is,  the 
observations  are  split  in  the  middle  and  the  first  obser- 
vation or  measurement  in  the  first  half  is  paired  with  the 
first  measurement  in  the  second  half,  the  second  paired 
with  the  second,  and  so  on.  If  there  is  an  odd  number  of 
observations,  the  middle  one  is  discarded.  The  number  of 
pairs  is  called  "n". 

The  number  of  pairs  where  the  second  entry  is  greater 
than  the  first  could  be  replaced  by  a plus  sign,  those 
where  the  second  entry  is  less,  a minus.  If  the  null 
hypothesis  is  that  there  is  no  trend,  the  number  of  plus 
signs  should  equal  the  number  of  minus  signs.  In  terms 
of  the  binomial  distribution,  this  means  the  probability 
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Fig  11,  The  Binomial  Distribution 


of  a plus  p(plus  sign)=p (minus  sign)~%.  This  hypothesis 
can  be  tested  by  asking  whether  such  a value  of  T,  the 
number  of  plus  signs,  lies  in  the  critical  region  of  the 
assumed  binomial  distribution,  i.e.,  chat  distribution 
with  p=%  and  number  of  pairs  "n".  See  Fig  11.  The  criti- 
cal region  is  that  point  where  T,  the  number  of  plus  signs, 
is  so  great  (or  small),  one  must  assume  the  sample  of  n 
pairs  did  not  come  from  a population  where  p (plus  sign)- 
p (minus  sign)=%.  That  is,  the  hypothesis  of  no  trend  must 
be  rejected.  The  size  of  the  critical  region,  a,  or  the 
significance  level,  has  basically  the  same  interpretation 
as  it  did  with  the  normal  distribution  in  Chapter  II. 

When  actually  applying  the  Cox  and  Stuart  test,  it  is 
sufficient  to  count  the  number  of  pairs  and  not  convert  to 


plus  and  minus  signs  as  such.  When  counting  the  pairs 
where  the  second  entry  is  greater  than  the  first  to  get 
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the  value  T,  one  thing  must  be  remembered.  Ties,  or  equal 
entries,  should  not  contribute  to  acceptance  or  rejection 
of  Hq,  so  should  not  be  counted. 

In  order  to  save  space,  tables  usually  stop  at  n=20, 
since  the  normal  distribution  is  a fair  approximation  for 
n>20 . 

If  there  are  cycles  in  the  data  and  one  knows  which 
pairs  of  measurements  appear  at  the  same  point  in  the 
cycle,  then  the  overall  trend  can  still  be  evaluated  with 
the  cyclical  influence  neutralized.  For  instance,  if 
there  were  a yearly  cycle,  pairing  the  same  months  to  com- 
pute the  T-statistic  would  be  done  to  test  the  presence  of 
an  underlying  trend. 

Efficiency,  or  power  of  the  test  for  a given  sample 
size,  can  be  improved  by  discarding  middle  observations. 

For  instance,  the  first  ten  observations  could  be  paired 
with  the  last  ten,  or  the  first  third  paired  with  the  last 
third.  If  loss  of  data  is  to  be  avoided,  another  variation 
is  suggested  (Ref  1:135).  Pairing  from  the  ends  of  the 
sequence  (ypYn)>  (y2>yn_i)»  etc.,  also  would  exaggerate 
trend  but  still  employ  all  data  points.  Usually  loss  of 
data  does  result  in  loss  of  power  (Ref  1:133). 

The  Cox  and  Stuart  test  may  also  be  used  for  correla- 
tion between  two  different  measures . The  paired  measures 
are  listed  by  the  rank  (in  size)  of  either  the  first  member 
of  each  pair  or  the  second.  If  one  member  has  less  ties 
than  the  other,  that  member  of  each  pair  is  preferred  for 


40 


the  ranking.  The  test  for  trend,  then,  is  applied  to  the 
remaining  member  of  the  pair.  An  upward  trend  would  indi- 
cate positive  correlation,  a do~mward  trend  would  indicate 
negative  correlation,  and  no  trend  would  indicate  zero 
correlation. 

Kendall's  Test  for  Trend.  The  next  two  methods, 
Kendall’s  and  Spearman's  tests  for  trend,  use  the  concept 
of  ranking.  Ranking  assumes  that  individuals  (or  air- 
planes, radar  sets)  can  be  arranged  in  order  according  to 
the  magnitude  of  some  measurable  quantity.  Customarily  we 
denote  the  ranks  by  1,  2,  ••*,  n from  smallest  to  largest. 

Before  defining  Kendall's  statistic,  let  us  define 
concordant  and  discordant  pairs.  Two  pairs  are  concordant 
if  both  members  of  one  pair  are  larger  than  the  respective 
member  of  the  other  pair.  For  example,  the  pairs  (1,3) 
end  (6,9)  are  concordant  since  both  6>1  and  9>3.  Similarly, 
two  pairs  are  discordant  if  one  member  is  smaller  and  one 
is  larger  than  the  corresponding  member  of  the  other  pair. 
For  example,  the  pairs  (1,6)  and  (9,3)  are  discordant. 

If  we  let  Nc  be  the  number  of  concordant  pairs  in  a 
set  of  pairs,  and  the  number  of  discordant  pairs, 
Kendall's  statistic  t is  defined  as  the  ratio: 


x 


VNd 

%n(n-l) 


(3-23) 


where  %n(n-l)  is  the  total  number  of  pairs  possible,  often 
denoted  by  f 2 J ==^xl (n~  1 ) °r  n things  taken  2 at  a time. 
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A "r"  of  -1  means  perfect  reverse  relationship , T =1  means 
perfect  positive  relationship,  andT=0  means  no  relation- 
ship. Instead  of  computing  -t  itself,  however,  if  Quan- 
tiles or  critical  values  of  T=Nc-K^  are  available,  these 
may  be  used. 

In  practice,  when  this  method  is  used  for  trending 
rather  than  correlation,  counting  concordant  and  discord- 
ant pairs  is  much  easier  than  implied  above.  Since  a 
time-sequence  of  measurements  are  in  a ranking  order 
already,  such  as  time-period  1,  2,  3,  *•♦,  n,  only  the 
corresponding  measurements  (score  for  time-period  1,  2, 
etc.)  need  be  considered.  The  steps  to  find  T,  T-f^-N^, 
are  covered  in  detail  in  the  guide.  Significance  testing 
is  accomplished  by  comparing  T to  the  critical  value  at 
significance  level  a,  as  given  in  tables. 

Spearman's  Test  for  Trend.  Spearman's  test  is  quite 
similar  to  Kendall's,  except  that  it  uses  the  differences 
R(x^)-R(y^)  between  the  ranks,  squares  them,  and  takes 
their  atom.  Then  the  statistic  is  defined  as  follows: 


P = 1 - 


6E (R(xi)-R(yi)) 2 
n3-n 


(3-24) 


The  quantity  R(x^) , again,  denotes  the  rank  of  the  time- 
period  number  (1st  period,  2nd  period,  •••,  nth  period) 
and  R(y^)  is  the  rank  of  the  measurements.  Instead  of 
computing  Eq  (3-24),  the  sum  of  squares,  S,  is  defined  as: 
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S = l (R(xi)-R(yi)) 


(3-25) 


and  critical  values  of  this  test-statistic  S are  well 
tabulated  and  widely  available.  Significance- testing  is 
accomplished  by  comparing  the  computed  S with  table  values 
to  determine  if  it  lies  in  the  critical  region. 
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IV.  The  User's  Guide 


Introduction 

This  guide  is  written  for  these  persons  who  are  put 
in  the  job  of  Trend  Analysis  (TA)  but  have  a limited 
statistics  or  mathematical  background.  It  is  written  v/ith 
an  aircrew  member  in  mind,  one  who  is  in  a staff  position, 
but  is  asked  to  do  TA.  Even  those  who  have  such  back- 
ground may  not  recognise  certain  problems  as  lending  them- 
selves to  statistical  solutions.  Also,  such  persons  may 
not  have  used  the  methods  for  so  long  they  are  no  longer 
familiar  (Ref  3:1).  This  guide  should  provide  the  user 
with  a tool  for  easily  determining  the  form  and  signifi- 
cance of  trends.  Then  the  user's  expertise  in  his  primary 
job  can  ^e  applied  more  readily  to  the  problem  of  explain- 
ing or  reversing  the  trend. 

Some  may  think  of  TA  as  strictly  identifying  weak 
areas  in  his  rsalm  of  responsibility.  This  is  fine,  but 
the  writer  feels  that  if  one  is  collecting  evaluation 


scores  (for  instance)  over  time  anyway,  and  has  methods 
available  for  measuring  trend,  he  may  find  doing  TA  on  his 
raw  data  yields  very  valuable  information.  Others  may 
have  the  data  dumped  in  their  lap,  so  to  speak,  and  told 
to  analyze  it  for  significant  trends.  Or  possibly  some 
are  told  to  set  up  a program  of  data  collection  that  will 
permit  identification  of  trends.  This  guide  is  written 
primarily  for  the  one  who  has  data  dumped  on  him,  but  all 
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may  find  it  useful.  That  1 famil-f at-i^ation  with  the 
methods  should  give  any  usei  o feel  for  what  collection 
and  classification  methods  netl  be  used. 

To  that  first  person  the  writer  says  he  might  also  be 
surprised  how  often  people  apply  "trend  analysis”  uncon- 
sciously. Whenever  a briefing  is  giv^n  with  comparisons 
made  by  graphs  and  charts  the  eye  does  some  "pattern 
recognition"  and  the  viewer  tends  to  extrapolate  or  project 
those  patterns  into  the  future . So , why  not  quantify  such 
things  ? 

This  guide  is  intended  to  be  relatively  painless.  It 
is  not  assumed  that  the  user  has  read  the  first  chapters, 
but  that  he  will  at  some  time  go  back  and  skim  over  them 
at  least.  They  cover  some  of  the  theoretical  background 
used  in  the  development  of  the  guide.  The  writer  asks 
forgiveness  if  it  seems  he  is  insulting  anyone's  intelli- 
gence by  belaboring  simple  things.  It  is  not  his  intention, 
he  just  wishes  to  cover  everything  in  sufficient  detail. 

A Word  on  Samples 

Basic  to  the  methods  here  is  the  idea  of  a sample 
mean.  It  is  nothing  more  than  an  average  (computation  is 
covered  in  Guidesheet  # 1)  and  it  is  assumed  everyone  has 
an  idea  of  what  an  average  is.  A sample  mean  is  an  average, 
but  the  word  "sample"  implies  the  presence  of  a larger 
"whole".  The  basic  idea  may  be  clear  from  an  example.  Let 
us  say  we  take  a sample  of  a group  of  radar-navigators' 
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bombing  CEPs.  If  the  sample  average  or  mean  is  small  (say 
100  ft) , this  means  the  group  is  a part  of  a "skillful" 
whole,  or  population,  a population  that  always  has  small 
CEPs.  If  the  group's  CEPs  were  large,  there  would  be  some 
question  as  to  whether  they  are  members  of  this  hypothet- 
ical skillful  population.  This  whole  idea  is  called 
statistical  inference,  i.e.,  can  we  "infer"  this  whole 
population  is  skillful?  Chapter  II  of  this  thesis  covers 
this  in  more  detail. 

Usage  of  the  term  "population"  in  statistics  is  a 
carry-over  from  the  days  when  statistics  was  applied  mainly 
to  sociological  and  economic  phenomena.  Nowadays  it  is 
applied  to  any  set  or  collection  of  objects,  actual  or  con- 
ceptual, and  mainly  to  sets  of  numbers,  measurements,  or 
observations  (Ref  4:160). 


Statistical  Inference 

Whenever  we  talk  about  inference,  we  need  some  measure 
of  how  confident  we  are  about  our  statistical  results. 
Statistical  literature  talks  about  confidence  level,  level 
of  significance,  confidence  intervals,  and  not  always  with 
consistency.  Chapter  II  of  this  guide  talks  about  the 
driving  concept  behind  all  of  these,  the  concept  of  the 
size  of  the  critical  region  a,  or  the  probability  of  a 
Type  I error.  In  short,  it  is  nothing  more  than  a measure 
of  how  confident  one  is  of  a relatively  unsure  thing.  We 
are  dealing  with  probabilities,  not  certainties,  so  we  are 
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never  sure  of  our  answers.  For  this  guide's  purposes  we 
will  consider  a,  the  level  of  significance,  as  the  proba- 
bility of  drawing  a wrong  conclusion  from  our  data. 
Specifically,  when  we  deal  with  hypotheses,  it  will  be  the 
probability  of  rejecting  a true  null  hypothesis  (see 
Chapter  II).  Normally  a is  chosen  to  be  a=.05,  somewhat 
less  often  chosen  to  be  a=.10  or  a=.01.  In  these  cases 
our  confidence  level  would  be  the  complement  of  a or 
l-.05=.95  and  1-. 10=90.  These  could  be  thought  of  as 
being  95%  and  90%  confident  of  our  conclusions  or  answers. 
Put  in  gambling  terms  a confidence  level  (again  the  comple- 
ment of  significance  level  a)  of  95%  would  mean  nineteen- 
to-one  odds  that  you  have  made  the  right  decision  or  that 
your  sample  value  is  close  enough  to  the  true  value  to  use. 
Likewise,  a confidence  level  of  90%  (ct=.10)  is  odds  of  nine- 
to-one . 

One  may  say  here,  "Why  not  be  99.9%  or  even  100%  con- 
fident all  the  time?"  The  reason  is  you  are  dealing  with 
a sample  and  can  never  be  absolutely  certain  about  the 
population  unless  your  sample  equals  the  population.  This 
is  sometimes  possible,  but  usually  oxitrageously  expensive. 
This  can  be  put  another  way  by  bringing  up  another  type  of 
error.  This  was  done  in  the  introductory  Chapter  II,  page 
8 , and  it  will  be  sufficient  to  say  (I  hope)  that  one 
increases  the  chances  of  this  second  type  error  as  he 
chooses  lower  values  of  a for  a given  sample  size. 
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Before  Picking  a Method 

Some  things  had  to  be  assumed  when  this  guide  was 
compiled.  First  and  foremost,  implied  in  every  Trend 
Analysis  method  (all  methods  except  those  dealing  with 
correlation)  is  that  some  sort  of  consistent  measurement 
is  taken  over  time.  It  may  be  once  a minute,  once  a 
flight,  once  every  2.73  hours  or  whatever.  The  assumption 
in  this  guide  is  that  one  is  measuring  the  same  thing  each 
time-period. 

For  all  methods  it  is  assumed  that  a hand-hold  elec- 
tronic calculator  is  available.  The  feature  of  floating- 
decimal  is  required  to  get  any  degree  of  accuracy.  A 
simple  four- function  calculator  of  this  type  is  barely 
sufficient  and  costs  less  than  five  dollars.  The  next 
desired  features  would  be  a square-root  function  and  then 
a power  function,  either  x2  or  yx  or  both.  Memory  feature 
would  also  be  a handy  asset. 

Finally,  it  is  assumed  that  one  will  plot  his  data 
over  time,  then  choose  the  method  for  trending.  The 
method  of  trend- description  chosen  should  fit  the  pattern 
suggested  by  the  time  plot.  See  Guidesheet  #2  for  plot- 
ting instructions  and  example. 

The  following  are  notes  and  cautions  that  need  to  be 
noted  for  each  method.  Although  many  of  these  deal  with 
the  time-plot,  they  need  to  be  mentioned  here  for  emphasis. 
Furthermore,  due  to  the  final  assumption  above,  one  must 
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think  of  these  in  conjunction  with  the  application  of  all 
the  methods  of  trending.  Notes: 

Q At  times  there  will  be  more  past  data  than  is  neces- 

sary, or  in  your  judgement  should  not  be  entered  into 
the  test  (rules  or  method  of  measurement  changed,  for 
instance).  If  this  is  the  case,  re-number  the  data 
points  so  that  time-period  #1  (x^=l)  is  the  first 
data  point  you  feel  should  enter  into  the  computa- 
tions . This  was  done  in  the  example  for  Curvilinear 
Regression,  Guidesheet  #6,  Step  3. 

(?) Do  not  discard  points,  unless  they  are  mis -measured 

or  otherwise  invalid.  If  one  is  discarded  for  being 
invalid,  or  a measurement  is  missing  for  that  time- 
period,  re-number  the  table  and  time-plot  leaving 
the  discarded  point  out  completely.  See  Guidesheet 
#2,  Step  8. 

3) As  soon  as  a time-plot  is  made,  connect  plotted 

points  with  straight  line-segments . This  will  give 
visual  impact,  making  it  easier  to  see  general  shape, 
cycles,  etc.,  in  the  data.  When  they  have  served 
their  purpose,  however,  erase  them  or  they  will  inter- 
fere with  the  computed  trend  line. 


®- 


@- 


! Usually  when  a new  measurement  is  made  and  recorded 

on  table  and  time-plot,  the  trend-line  must  be  re- 
computed. If  the  new  point  lies  on  or  very  close  to 
the  forecast  line,  however,  this  is  not  necessary. 
Also,  the  method  used  last  may  not  be  the  best  this 
time,  so  reevaluate  each  new  time-period. 

-If  the  data  suggests  a curve- type  trend,  and  there  is 
no  time  to  do  extensive  computations,  a "french 
curve"  may  be  accurate  enough,  and  control  chart 
evaluations  are  still  valid. 
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Cautions 

(T) — Do  not  project  trend-lines  too  far  into  the  future.  They 
tend  to  become  very  inaccurate  after  3 or  so  periods  into 
the  future,  especially  the  trends  described  by  curved  lines 

(2^ Be  careful  to  keep  as  many  decimal  places  as  possible 

in  the  computations,  because  subtraction  accumulates 
round-off  errors.  Final  answers  may  be  rounded  off. 

(IT) Be  extremely  careful  when  using  the  Control-Chart  method 

(Guidesheets  #3  or  #4)  for  evaluating  trends  in  average 
test  scores.  If  the  test  scores  are  percentages,  they 
may  be  quite  high,  especially  when  one  is  talking  about 
Stan/Eval  test  scores.  Contrary  to  popular  belief, 
such  scores  may  not  be  normally  distributed.  This  may  be 
because  no  percentage  scores  can  possibly  be  above  1007, 
Such  distribution  may  be  highly  skewed  (unsymmetrical) 
even  when  the  standard  deviation  is  small,  say  2.  Then, 
with  such  a u,  a trend  may  be  considered  significant  when 
scores  deviate  very  little  from  the  standard.  In  this 
case,  methods  3 and  4 are  invalid.  Guidesheets  11,  12, 
and  13  would  be  valid,  however. 

(4) No  statistical  test  should  be  blindly  followed.  If  a 

test  given  in  this  guide  gives  illogical  or  inconsistent 
results,  try  one  of  the  non-parametric  methods  (11,  12 
or  13) . The  problem  is  probably  with  the  form  of  the 
assumed  distribution,  such  as  noted  in  caution  3 above. 

(jT) Be  careful  of  momentary  deviations  (a  single  measure- 

ment deviating  very  much  from  a standard.  Normally  one 
should  never  decide  there  is  a trend  based  on  one  data 
point.  Momentary  deviations  cannot  be  handled  (ex- 
plained) by  statistical  methods. 

(T) When  testing  for  trends  in  a single  measurement  per  time- 

period,  use  Guidesheets  11,  12,  and  13.  Control  Charts 
may  only  be  used  for  trends  in  averages . 
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GUIDESHEET  #1.  The  Mean  and  Standard  Deviation 


Steps : 


-Count  the  total  number  of  observations  or  measure- 
ments which  are  to  be  averaged.  Call  this  total 
number  "n". 


-Arrange  measurements  in  vertical  columns  (as  in 
example)  with  column  headings  as  follows.  Column  3 
will  be  filled  in  later. 


Name  or 
Observation 
Number 
i 


t 


Measurement 

y-f 


Square 


1 


-f 


sv- 


n 


(T) — compute  I yi=yi+y2+y3+*  * *+yn>  t^ie  sum  a n the 

measurements.  The  space  directly  under  column  2 may 


be  used  for  Zy^. 

-Compute  y,  the  Mean  of  the  n measurements,  by  divid- 
ing the  sum  Zy^  by  the  total  number  of  observations 


n. 


Mean 


©~ 

©- 

©- 


-Square  each  measurement,  y|=y^*y£  and  record  in 
column  3. 

-Compute  the  sum  of  squares  of  the  observations 
ZyJa,yi+y2+y3+’  ‘ '+yn‘  sPace  directly  below  column 
3 may  be  used  to  record  the  value  of  Zy?. 

-Compute  the  variance  s2  according  to  the  following 
formula : 


51 


g2  _ 

n(n-l) 

--Compute  the  Standard  Deviation  s=/s2,  that  is,  take 
the  square  root  of  s2  in  order  to  obtain  s. 
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EXAMPLE  for  Mean  and  Standard  Deviation  (Guidesheec  #1) 


Steps : 

@ Ten  aircraft  have  flown  this  week  against  a certain 

target.  It  is  desired  to  find  what  the  average  bomb 
score  is.  n=10. 


©- 


Name  or 
Observation 
Number 


Measurement 

(bomb  score  in  Square 


yi 

yi 

Acft  # 1 

0.5 

.25 

" # 2 

21.3 

453.69 

" # 3 

6.9 

47.61 

•'  # 4 

11.0 

121.00 

" # 5 

5.7 

32.49 

" # 6 

7.2 

51.84 

" # 7 

10.5 

110.25 

” # 8 

11.8 

139.24 

" # 9 

3.5 

12.25 

" #10 

13.6 

184.96 

n 10 

(Jy — .I  _ 1 yi=:yj+y2'}-y3'f • • *+y3Q=92 • Q recorded  as  show 
above. 

®Zy.  q o n 

The  mean,  or  average:  y= — — = = 9.20  hundreds 

„ c cu  non  c*-  n 

X A.  U • W A J t-\J  JLU  . 

( J T) Squaring  each  measurement: 

y£  = y1*y1  = (.5) *(.5)  = .25  recorded  as  shown 


Ey .=92. 0 

j x 


Ey?=1153 . 58 


0 recorded  as  show 


y|  - y2-y2  - (21. 3). (21. 3)  - 453.69  " " 

• • • 

• • • 

• • ♦ 

y10=y10*y10=  (13.6)-(13.6)  = 184.96  " " 

(IT) The  sum  of  the  squares:  Iy|=y|+y2+* * *+y £q= 

. 25+453. 69+*  • ‘+184. 96  = 1153.58  recorded  as  show. 


(7)— The  variance  s 2 : 

. n (2y?)  - (^y-j.) 


n(n-l) 


10(1153. 58)-(92,0)2 
10(10-1) 


11535. 8-(92.0)»(92.0)  = 11535.8-8464 
10(9)  90 


3071.8 

90 


= 34.13 


©- 


The  Standard  Deviation  s : 


s = /s7  = /34.13  = 5.84  hundreds  of  ft.  or  584  ft. 
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GUIDESHEET  #2.  The  Time-Plot 


Steps : 
©- 

©- 


Obtain  some  graph-paper.  It  is  much  too  tedious  to 
draw  a useful  time-line  on  plain  paper. 

The  graph-paper  may  be  used  vertically  or  horizon- 
tally, whichever  allows  the  necessary  number  of 
divisions  to  be  used  for  each  axis.  Consider  the 
inconvenience  to  the  reader  if  he  has  to  turn  a 
bound  paper  or  report  sideways  to  read  the  graph. 

(T) — For  a time-plot,  the  vertical  axis  is  used  for  meas- 
urements and  the  horizontal  axis  for  time-period 
numbers.  If  measurements  cover  a short  range  of  high 
values,  you  may  wish  to  start  the  scale  on  the  ver- 
tical axis  just  below  the  lowest  possible  measurement 
value  and  end  just  above  the  highest.  If  this  is 
done,  draw  in  a discontinuity  just  above  where  the 
two  axes  cross  (the  "origin"  or  zero  point  for  each 
axis)  to  indicate  this  was  done. 

(V) Label  the  vertical  axis  the  y-axis.  Also  write  in 

"scores"  or  "measurements"  or  "CEPs"  in  parentheses, 
whatever  is  appropriate.  Label  every  bold  horizontal 
line  with  an  even  number  in  your  range  of  values. 

(jf) Label  the  horizontal  axis  the  x-axis.  This  axis  is 

for  the  time-period  numbers . Time-period  numbers 
(1,  2,  ••*,  n)  are  used  rather  than  the  time-period 
names  to  aid  in  computation  and  to  avoid  clutter  and 
confusion.  For  instance,  tests  may  not  be  given,  or 
flights  flown,  on  the  same  date  each  time-period,  so 
trying  to  record  day  and  month  would  get  messy.  Your 
table  has  the  correct  information,  presumably. 

(T) Leave  plenty  of  time-periods  in  the  future  for  trend- 

ing and  forecasting. 
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-Plot  the  first  measurement.  That  is,  for  x=i,  plot 
the  corresponding  y-value.  For  trending,  there 
should  only  be  one  measurement  plotted  for  each  time- 
period.  If  there  are  more  than  one  for  any  single 
time-period,  the  mean  or  average  is  the  appropriate 
value  to  plot.  See  Guidesheet  #1. 

-If  there  is  a missing  measurement,  do  not  count  that 
time-period  at  all,  that  is,  no  time-period  number 
is  assigned  to  that  month,  for  instance. 

-Plot  the  remaining  values . 

-Connect  each  point  with  short  line- segments . This 
will  give  you  an  idea  about  what  sort  of  trend  there 
is,  thus  which  trending  method  would  be  most  appro- 
priate. Do  this  in  pencil,  because  it  should  be 
erased  before  superimposing  the  computed  trend  line. 


-If  text  is  to  be  added  and  it  must  accompany  the 
graph,  graph-paper  is  reproducible,  so  xerox  it,  cut 
it  out,  and  place  it  above  your  text.  If  necessary, 
reproduce  the  combined  page  to  avoid  glueing  mess. 


y-axis(Test  Scores, %) 


EXAMPLE  for  Time-Plot  (Guidesheet  #2) 


dr 


7 9 11  13  15  17  19  21  23  25  27 

x-axis  (Time-Period  Numbers 


Steps  (T)  and  (T)  are  self  explanatory.  The  remaining 

step-numbers  are  placed  at  the  appropriate  plac 


Time-Period  Time-Period 
Name  Number 


22  Sept 

16  Oct 

17  Nov 
8 Dec 


Measurement 
(Test  Scores,  %) 

*i 


97.6 

nn  a 

y J . 0 

91.0 

99.0 
99.5 

98.0 

97.0 

• Mi s s ingrNo— ' Tes fc— 

97.0 

98.0 

96.0 

95.0 


GUIDESHEET  #3.  The  Control  Chart  for  Small  Samples 


Assumptions : 

@ — The  measurements  are  a sample  from  a normal  popiila- 
tion.  That  is,  if  there  were  many  of  these  meas- 
urements, they  would  be  distributed  according  to  the 
well-known  "bell- curve".  Usually  measurements  such 
as  test  scores,  bomb  scores,  etc.,  can  be  treated  as 
normally  distributed.  Another  way  of  checking  this 
is  to  ask  if  there  were  many  measurements  of  this 
type,  would  most  be  near  the  average  and  few  far  from 
it,  both  above  and  below.  If  the  answer  is  yes,  this 
assumption  is  probably  a good  one.  If  this  assump- 
tion  cannot  be  met,  go  to  Gui desheet  #4. 


-The  sample  drawn  is  a random  sample,  not  biased. 


Steps : 


Assume  a value  for  the  population  mean  p=pQ.  This 
may  be  the  actual  population  mean  (rarely  known  for 
sure)  or  a hypothesized  mean  such  as  a standard  one 
wishes  to  meet  or  continue  to  meet. 


-Draw  a chart  for  your  measurements  (y-axis)  over  time 
(x-axis).  Draw  a horizontal  line  at  y=P0- 


-Count  the  number  of  measurements  that  make  up  the 
sample  each  time-period.  This  is  the  sample  size 


-Compute  the  sample  standard  deviation  's"  (see  Guide- 
sheet  #1).  This  s should  be  nearly  constant  across 
all  time-periods.  If  it  is  not,  see  Note  #3. 


(T) Choose  a level  of  significance  a. 

(IT) Look  up  the  table  value  of  the  t-statistic  (Table  I 

on  page  62)  corresponding  to  the  chosen  a and 
degrees- of-freedom  v=n-l.  Call  this  value  t(Table)=t 
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-Compute  the  Lower  Control  Limit  (LCL)  of  the  Control 
Chart  according  to  the  following  formula: 


LCL  - y - t (Table)— ~ 
0 /n 


(1) 


-Draw  a horizontal  line  at  y=LCL  on  the  Control  Chart. 
A dotted  line  or  different  color  line  may  be  used  to 
distinguish  it  from  the  line  y=viQ. 


9 J If  any  single  sample  mean  ever  falls  below  the  LCL, 

or  means  begin  to  approach  it  (a  trend) , then  this 
indicates  that,  with  probability  (1-a) , you  are  no 
longer  meeting  your  standard,  based  on  the  samples 
taken.  That  is,  a trend  line,  described  by  one  of 
the  methods  in  this  guidebook,  is  significant  at  the 
a- level  if  it  touches  the  LCL. 


Notes : 

@ If  the  UCL  (Upper  Control  Limit)  is  needed  instead, 


UCL-u  +t  (Table) 

° / n 


All  else  is  the  same. 


(T) The  smaller  one  chooses  a to  be,  the  further  from  the 

standard  the  control  limit  line  (UCL  or  LCL)  will  be. 
A very  small  a,  however,  may  limit  your  ability  to 
detect  trends. 

•If  the  standard  deviation  s changes  significantly 
over  the  time-periods,  the  control  limit  must  be 
recomputed  and  replotted  each  time-period.  This 
would  make  a segmented  limit.  It  does  not  invalidate 
the  method  in  any  way  (Ref  4:426).  If  this  is  not 
desirable,  a non-parametric  method  (Guidesheet  #11, 
12,  or  13)  must  be  used. 
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EXAMPLE  for  Control  Chart  for  Small  Samples  (Guidesheet  #3) 


Steps : 


-The  Wing  Commander  is  interested  in  how  well  his 
people  will  measure  up  to  a standard  of  90%  correct 
responses  on  Stan/Eval  tests  in  the  next  six  months. 
He  asks  his  trend  analysis  shop  to  monitor  Stan/Eval 
test  scores  for  trends.  Our  Wing  Commander  is 
assuming  uo=90%. 


©~ 


-The  Wing  is  large  and  only  a few  personnel  are  tested 
each  month,  actually  six.  The  TA  officer  knows  it 
will  be  helpful  to  look  at  past  records  at  least  as 
a start  and  to  see  if  the  90?o  standard  is  a reason- 
able one.  He  draws  o.  chart  covering  all  past  data 
plus  plenty  of  room  for  forecasting  or  recording  new 
data.  The  data  and  chart  are  shown  here: 


Month  #1  #2  #3  #4  #5  #6  #7  #8  #9  #10  #11  #12 

Score  91.5  91  89  88  92.1  90  90  89  89  88  88  87.-5 


3 j — Records  showed  that  about  the  same  number  of  persons 
are  tested  each  month  and  the  scores  don't  seem  to 
vary  a great  deal,  unless  there  is  a big  turnover  in 
people.  That  is,  n=6  every  month. 


— The  computed  sample  standard  deviation  (Guidesheet 
#1)  turned  out  to  be  s=3.  The  quantity  s was  nearly 
constant  over  time. 


60 


-The  TA  officer  wanted  to  be  reasonably  sure  he  did 
not  report  a trend  when  there  was  actually  none 
(Type  I error) , his  heart  being  with  the  crew  mem- 
bers, so  he  picked  a=.01. 

-Looking  in  Table  I,  he  found  t (Table)  to  be  3.365. 
For  a=.01,  and  v=n-l=5,  t(Table)=3. 365.  See  portion 
of  Table  below. 


V 

£*■0.10 

a -0.05 

a- 0025 

a- 0.01 

a® 

l 

3.078 

6.314 

12.706 

01.82) 

63 

2 

1.886 

2.920 

4.303 

6.965 

9 

% 

1.638 

2.353 

3.182 

4.541 

5 

4 

5 

1.533 

1.476 

2.132 

2 015 

2.776 

2,571 

3.474 

&) 

4 

4 

6 

1.440 

1.943 

2.447 

3.143 

3 

7 

1.415 

1.895 

2.365 

2.998 

3 

8 

1 397 

1 K60 

7 304 

■>Mt, 

3 

— The  lower  control  limit  was  computed  as  follows : 


LCL  = y -t  (Table) 

° /n 


= 90-(3. 365) -(1.225)  = 90-4.12=  85.88 
LCL  = 85,9 

■-The  control  chart  was  drawn  as  shown. 


94 

*8 

ra 

90 

a) 

n 

o 

86 

o 

CO 

82 

•H 

i 

Standard 


© © 


©~  -0 ©- 


LCL  (a=.01 


1 2 3 4 5 6 7 8 9 10  11  12  13  14 

(Time-Period) 

--If  a were  not  so  small,  say  a=.10,  LCL  would  be  88.2 
One  is  less  sure,  however,  when  he  says  there  is  a 
trend.  He  has  an  a=.10  probability  of  saying  there 
is  a trend  when  there  is  none  as  opposed  to  a=.01 
(much  less  probability) . 


Table  I 

The  Student-T  Distribution 


Values  of  f„* 


V 

a -0.10  * 

Or*  0.05 

a»  0.025 

Or*  0.01 

a - 0.005 

V 

1 

3 078 

6.314 

12.706 

31.821 

63.657 

1 

2 

1.886 

2.920 

4.303 

6.96S 

9.925 

2 

3 

1.638 

2.353 

3.182 

4341 

5.841 

3 

4 

1.533 

2.132 

2.776 

3.474 

4.604 

4 

S 

1.476 

2.015 

2371 

3.365 

4.032 

5 

6 

1.440 

1.943 

2.447 

3.143 

3.707 

6 

7 ' 

1.415 

1.895 

2.365 

2.998 

3.499 

7 

8 

1.397 

1.860 

2 306 

2.896 

3.355 

8 

9 

1.383 

1.833 

2.262 

2.821 

9 

10 

1.372 

1.812 

2.228 

2.764  . 

3.169 

10 

11 

1.363 

1.796 

2.201 

2.718 

3.106 

11 

12 

1.356 

1.782 

2.179 

2.681 

3 055 

12 

13 

1.350 

1.771 

13 

14 

1.345 

1.761 

2.145 

2.624 

2.977 

14 

15 

1.341 

1.753 

2.131 

2.602 

. 2.947 

15 

16 

1.337 

1.746 

2.120 

'2383 

2.921 

16 

17 

1.333 

1.740 

2.110 

2.567 

2.898 

17 

18 

1.330 

1.734 

2.101 

2.5S2 

2.878 

18 

19 

1.328 

1.729 

2.093 

2.539 

2.861 

19 

20 

1.325- 

1.725 

2 086 

2.528 

2 84S 

20 

21 

1.323 

1.721 

2.080 

2.518 

2.831 

21 

22 

1.321 

1.717 

2.074 

2.508 

2.819 

22 

23 

1.319 

1.714 

2.069 

2.500 

2.807 

23 

24 

1318 

1.711 

2.064 

2.492 

2.797 

24 

25 

1.316 

1.708 

2.060 

2.485 

2.787 

25 

26 

1315 

1.706 

2.056 

2.47) 

2.779 

26 

27 

1314 

1.703 

2.052 

2.473 

2.771 

27 

28 

1.313 

1.701 

2.048 

2.467 

2.763 

28 

29 

1.311 

1.699 

2.045 

2.462 

2.756 

29 

inf 

1.282 

1.645 

1.960 

2.326 

2.576 

inf. 

62 


GUIDESHEET  #4.  The  Control  Chart  for  Large  Samples 


Assumptions : 

© Guidesheet  #3  cannot  be  used  because  the  assumption 

of  normality  is  not  a reasonable  assumption.  How- 
ever, to  use  this  method,  "n"  (number  of  measurements 
in  a sample)  must  be  large,  n>30.  If  n-30  and  we 
cannot  asstime  normality,  then  non-parametric  methods 
must  be  used  to  determine  significance  of  trends. 

See  Guidesheet  #11,  12,  or  13. 


-The  sample  drawn  is  a random  sample,  not  biased. 


Steps : 


-Assume  a value  for  the  population  mean  P=y0*  This 
may  be  the  actual  population  mean  (rarely  known  for 
sure)  or  a hypothesized  mean  such  as  a standard  one 
wishes  to  meet  or  continue  to  meet. 


©~ 


-Draw  a chart  for  your  measurements  (y-axis)  over  time 
(x-axis).  Draw  a horizontal  line  at  y=yQ. 

-Count  the  number  of  measurements  that  make  up  the 
sample  each  time-period.  This  is  the  sample  size 


Compute  the  sample  standard  deviation  "s"  (see  Guide- 
sheet  #1) . This  s should  be  nearly  constant  across 
all  time-periods.  If  it  is  not,  see  Note  #3. 


( 5 Choose  a level  of  significance  a. 

) — Look  up  the  table  value  of  the  z-statistic  in  Note  #4 

corresponding  to  the  chosen  value  of  a.  Call  this 

value  z(Table)=z  . 

a 

(T) Compute  the  Lower  Control  Limit  (LCL)  of  the  Control 

Chart  according  to  the  following  formula: 
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LCL  = y -z  (Table)  • — 
° /n 


(1) 


{8j Draw  a horizontal  line  at  y=LCL  on  the  Control  Chart. 

A dotted  line  or  different  color  line  may  be  used  to 
distinguish  it  from  the  line  y=yQ. 

^9^ — If  any  single  sample  mean  ever  falls  below  the  LCL, 
or  means  begin  to  approach  it  (a  trend) , then  this 
indicates  that,  with  probability  (1-a) , you  are  no 
longer  meeting  your  standard,  based  on  the  samples 
taken.  That  is,  a trend  line,  described  by  one  of 
the  methods  in  this  guidebook,  is  significant  at  the 
a- level  if  it  touches  the  LCL. 


Notes : 


-If  the  UCL  (Upper  Control  Limit)  is  needed  instead 

UCL=y+z (Table) ‘-rlr:  . All  else  is  the  same, 
o /n 


f 


The  smaller  one  chooses  a to  be,  the  further  from  the 
standard  the  control  limit  line  (UCL  or  LCL)  will  be. 
A very  small  a,  however,  may  limit  your  ability  to 
detect  trends. 


— If  the  standard  deviation  s changes  significantly 
over  the  time-periods,  the  control  limit  must  be 
recomputed  and  replotted  each  time-period.  This 
would  make  a segmented  limit.  It  does  not  invalidate 
the  method  in  any  way  (Ref  4:426).  If  this  is  not 
desirable,  a non-parametric  method  (Guidesheet  #11, 
12,  or  13)  must  be  used. 


-Values  of  z-statistic  for  various  values  of  a are 
given  here 


a = 0.10  0.05  0.025  0.01 
z = 1.28  1.645  1.96  2.33 


0.005 

2.575 
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EXAMPLE  for  Control  Chart  for  Large  Samples  (Guidesheet  #4) 


Steps: 


-The  trend  analysis  officer  at  Headquarters  keeps 
track  of  how  well  his  command  does  on  Stan/Eval  tests 
each  period.  Since  he  requires  nearly  the  same  num- 
ber of  tests  to  be  reported  each  period,  and  this 
number  is  greater  than  30,  he  chooses  this  method 
(large- sample)  to  build  a control  chart.  He  is 
interested  in  maintaining  a standard  of  90%  correct 
responses  on  tests  so  his  hypothesized  mean  is 
yo=90%. 

-The  TA  officer  has  past  data,  so  he  plots  it  over 
time  for  informational  purposes  as  shown  and  draws 
in  y=yQ: 


Time-Period: 

#1 

#2 

#3 

#4  #5 

#6 

#7 

#8 

Average  Score: 

91.5 

90.8 

90.2 

91  89.8 

89.7 

90.5 

90 

#9 

#10 

#11 

#12 

89.8 

89 

91 

89.2 

-The  number  of  tost  scores  turnod  in  each  time-period 
is  around  31,  so  he  uses  n=31. 
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(V) The  sample  standard  deviation  s is  computed  accord- 

ing to  Guidesheet  i If l and  is  equal  to  s=6,3.  Records 
showed  this  was  nearly  constant  in  the  past  and  there 
was  no  reason  to  believe  it  would  not  stay  nearly 
constant. 

(jP) — The  TA  officer  chooses  a=.01,  thus  keeping  the  proba- 
bility of  a Type  I error  low.  That  is,  he  does  not 
care  to  report  a trend  when  there ' s a good  chance 
there  is  none. 

(T) From  the  table  in  Note  #4  we  get  the  table  value  of 

the  test-statistic  corresponding  to  a*. 01: 

2 (Table)=  2.33 

(^T) The  Lower  Control  Limit  (LCL)  is  computed  as  shown: 

LCL  - u-z (Table) •— ~ 

° /n 

= 90-2.33U~?J  = 90-2. 33 ( . 593)  = 38.618 


LCL  = 88.6 


(8) The  TA  officer  adds  the  LCL  to  the  chart  as  shown: 

Standard 
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_Q. 


LCL(a= . 05) 


X 


LCL (a-. 01) 


H 1 +■ 


H 1 1 1 1 1 (- 

8 9 10  il  12  13  14 


x.  (Time-Period) 

-If  a were  •‘not  so  small,  say  a=.05,  the  LCL  would  be 
89.0%.  One  would  be  less  sure,  however,  when  he 
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A ^ 


m 


M 


m 


says  there  is  a trend.  He  has  a a=,05  probability 
of  saying  there  is  a trend  when  there  actually  is 
none  (as  opposed  to  a probability  of  .01). 
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GUIDESHEET  #5.  Straight-Line  Least-Squares-Best-Fit . 


Assumption? : 


-This  method  is  to  be  used  when  the  time-plot  of  the 
data  suggests  a straight-line  trend. 


Steps : 


-Arrange  the  data  in  vertical  columns  (as  in  example) 
with  column  headings  as  follows.  Columns  k and  5 
will  be  filled  in  later. 


Period 

Name 

Period 

Number 

X. 

i 

Measurement 

*i 

Square 

X? 

X 

Product 

x-Yi 

J 

r ' 

J 

r ' 

1 

Sy-jT Exi*yi= 


-Count  the  total  number  of  observations  or  measure- 
ments that  are  to  be  tested  for  trend.  Call  this 
total  number  "n". 


-Compute  the  square  of  each  period  number.  That  is, 
compute  xi=xi‘xi  anci  record  each  value  in  the  desig- 
nated column. 


-Compute  the  product  of  each  period  number  and  meas- 
urement. That  is,  compute  the  product  x^y^  and 
record  in  the  designated  column. 


-Add  up  each  column.  That  is,  compute  the  sums:  Ex^, 

^i’  2xi’  an<^  2xi*^i  anc^  record  in  the  appropriate 
place  at  the  bottom  of  each  column. 


-Compute  the  Mean  of  x.^  and  Mean  of  y^  according  to 
the  following  formulas. 


n 


(1) 
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(2) 


" = fli 

^ n 


■Compute  the  quantity  S 

XX 


sxx = nK  - (K) 


•Compute  the  quantity  S . 


Sxy  = ^Vi  - dxi)  (Z7i) 


(3) 


(4) 


-Compute  "b",  the  "slope"  of  the  line-of-best-fit . 

b . !s . nKrj  - (K)  (Iy±) 

sxx  ' nlx|  - (Z^) 2 (5) 

-Compute  "a",  the  y-intercept  of  the  line-of-best-fit. 

a = y - bx  (6) 

-Write  the  algebraic  equation  of  the  line-of-best-fit. 

y a + bx  (7) 

Draw  the  line-of-best-fit  through  your  data  points. 
Two  points  allow  you  to  do  this.  At  year  0 (x=Q) 
the  equation  says  y=a  so  the  line  must  pass  through 
this  point.  Substitute  any  other  value  of  x (prefer- 
ably near  the  end  of  the  data)  and  plot  the  corre- 
sponding value  of  y.  Draw  a straight  line  through 
these  points. 

When  used  in  conj motion  with  a control  chart,  a 
trend  becomes  significant  when  the  trend  line  touches 
one  of  the  limits,  either  the  LCL  or  the  UCL.  See 
Guidesheet  #3  or  #4. 


If  the  slope  of  the  line,  b,  is  zero,  there  is  no 
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trend.  Tnis  would  be  a quick  check.  That  is,  the 
size  of  "b"  can  be  looked  at  as  an  indication  of 
the  magnitude  of  the  trend. 
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EXAMPLE  For  Straight-Line  Least-Squares-Best-Fit 

(Guide sheet  #5) 


Period 

Name 

Period 

Number 

X. 

l 

Measurement 

*i 

Square 

X? 

X 

Product 

x.  *y . 
i ;i 

Jan 

1 

97.6 

1 

97.6 

Feb 

2 

93.0 

4 

186.0 

Mar 

3 (1) 

91.0 

9 

273.0 

Apr 

4 (2) 

99.0 

16 

396.0 

May 

5 (3) 

99.5 

25 

497.5 

Jun 

6 (4) 

98.0 

3b 

588.0 

Jul 

7 (5) 

97.0 

49 

679.0 

Aug 

8 (6) 

96.0 

64 

768.0 

Sept 

9 (7) 

97.0 

81 

873.0 

Oct 

10  (8) 

98.0 

100 

980.0 

Nov 

11  (9) 

96.0 

121 

1056.0 

Dec 

12  (10) 

95.0 

144 

1140.0 

Ex. =78  Ey_.  = 1157.1 

1 — J i 


Ex?=650 


Ex.  *y  .= 

l 

7534.] 


— n=12.  Note:  If  you  wished  not  to  include  the  first 
two  measurements  in  this  test  for  trend  due 
to  their  position  in  the  time-line,  you  would 
have  to  re-number  both  time-line  and  table 
from  1 to  10  (shown  in  parentheses) . This  is 
not  done  here. 


■Compute  squares 


X1 

ii 

K 

t-1 

‘ *1  - 

x2 

= x2 

’ x2  = 

x3 

= x3 

* x3  = 

recorded  as  shown 


ii  ii 


x12  ' x12  ’ x12  " 12  ' 12  =144 


— Compute  products  x^*y^: 


X1 

1! 

H 

• 

i 

• 97.6 

x2 

* ^2  = 

2 

• 93.0 

x3 

* ^3  = 

• 

• 

3 

• 91.0 

• 

• 

x12 

• 

• yj9  - 

12 

• 

• 95.0 

(jP) Sum 

£ 


97.6  recorded  as  shown. 


- 186.0 

It 

tt 

It 

= 273.0 

tt 

tt 

tt 

• 

M 

tt 

It 

• 

tt 

tt 

tt 

• 

tt 

tt 

tt 

1140.0 

tt 

tt 

tt 

78  recorded  as  shown 


Sv  4 

5.0  = 1157.1  recorded  as  shown 

Sum  column  4. 

Ex?  = 1+4+9+16+ •• *+144  = 650.0  recorded  as  shown 
Sum  column  5 : 

Zx.y.  = 97 . 6+186+273+* • *+1140  = 7534.1  recorded 

i l 

as  shown. 

(^T) Compute  mean  of  and  y^: 


x 


Zxl  _ 78 
n 12 


6.5 


x = 6.5 


ns7  i 

y = — - ~ ±±1L±  = 96.425  y = 96.425 

n 12 

©—Compute  Sxx: 

= r‘Ixf  " 


(Tx,)2  = 12(650)- (78) 2 


( IT ) Compute  S : 


®- 


Sxy  “ nt\-y±  - (K)  (&i) 

= 12(7534.1)  - (78)(1157.1) 

= 90,409.2  - 90,253.8  = 155.4 
Sxy  =155.4 

-Compute  the  slope  "b" : 

b ■ “ 1,55.-...4  = .0906 

Sxx  ^716 


ay 

(S) 


b = .0906 

10) --Compute  "a",  the  y-intercept: 

a = y - bx  = 96.425  - (.0906)(6.5) 

= 96.425  - .588 
a = 95.8 

-Write  the  equation  of  the  line:  y = 95.8  + . 0906x^ 
-Plot  the  line : 

y=95. 8+.0906x^ 

If  x=0 , y=?5.8 
If  x=l0, y=95. 8+. 906 
y=96. 7 

Note:  If  "b"  had  been  a negative  number,  the  line  of  best- 
fit  would  have  had  a downward  slope. 

(l3) --If  for  some  given  a,  a UCL  was  computed  to  be  UCL= 
98.0,  and  plotted  as  shown,  one  could  see  the  trend 
line  d<.  es  r.ot  touch  for  many  time-periods . That  is , 
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there  would  be  plenty  of  chance  for  measurements  to 
reverse  direction  or  level  off.  This  illustrates 
Caution  #1,  page  50,  that  one  should  reserve  judge- 
ment unless  trends  reach  a limit  before  one  could 
observe  new  measurements. 


GUIDESHEET  #6.  Curvilinear  Least-Squares-Best-Fit 


Assumptions : 


-This  method  is  to  be  used  when  a time-plot  of  the 
daca,  suggests  a curved- line  trend. 


Steps : 


Arrange  the  data  in  vertical  columns  (as  in  example) 

with  column  headings  as  follows.  Columns  4 through 
will  be  filled  in  later. 
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Period 

Name 

Period 

# 

X. 

X 

Measure- 

ment 

yi 

Square 

X? 

1 

Cube 

X? 

X 

Fourth 

xlf 

1 

Product 

x.  *y. 

1 ;x 

Product 

xi*yi 

f * 

' * 

' t 

» * 

' > 

■ 

Syi 

Ex? 

Ex? 

Ex* 

n. 

EXi-yi 

Ex?-yt 

-Count  the  total  number  of  observations  or  measure- 
ments which  are  to  be  tested  for  trend.  Call  this 
total  number  "n". 


(7) Compute  the  square  of  each  period  number  x^  and 

record  these  x?  values  in  column  4. 

(4) Compute  the  cube  of  each  period  number  x^.  This  may- 

be done  by  multiplying  the  x?  values  in  column  4 by 
x • , xl=x?*x. . Record  each  x?  value  in  column  5. 

(7) Compute  the  fourth  power  of  each  x.. . This  may  be 

done  by  multiplying  the  x?  values  in  column  5 by  x^, 

or  x!f=xf*x..  Record  each  x!f  value  in  column  6. 

1 x 1 1 

(7) Multiply  each  x^  and  y^  and  record  each  product  in 

column  7. 

(7) Multiply  each  quantity  x^»y^  by  x^.  That  is,  compute 
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each  product  x**y. . Record  each  x?*y.  value  in 
i 1 i x 

column  8. 

-Sum  each  column  ax.,  Ey  . , Ex?,  Ex?,  Ex!?,  Ex. *y. , and 

1.  •*.  *l< 

Zxi*^i‘  ^ecor(^  these  sums  in  the  appropriate  place 
below  each  column. 

-Arrange  three  equations  as  shown  below.  The  values 
in  parentheses  have  been  computed  in  previous  steps 
and  are  the  "coefficients"  of  the  unknowns  a,  b,  and 
c.  We  will  be  solving  this  system  of  three  simultan- 
eous equations  for  the  three  unknowns  a,  b,  and  c. 
Number  the  equations  (1),  (2),  and  (3)  as  shown. 

(n)a  + (Zx)b  + (Ex2)c  = (Ey)  (1) 

(Ex)a  + (Ex2)b  + (Ex3)c  = (Exy)  (2) 

(Ex2)a  *f*  (Zx3)b  + (Ex4) c = (Ex2y)  (3) 

-Eliminate  the  unknown  "a"  between  equations  (1)  and 
(2)  See  Note  1.  To  do  this,  multiply  equation  (1) 
by  the  coefficient  of  "a"  in  equation  (2)  to  form 
equation  (4) . Multiply  equation  (2)  by  the  coef- 
ficient of  "a"  in  equation  (1)  to  form  equation  (5) . 
Subtract  equation  (5)  from  equation  (4)  to  form 
equation  (6),  with  the  unknown  "a"  eliminated. 

-Eliminate  the  same  unknown  "a"  between  equations  (1) 
and  (3)  by  the  same  method  as  Step  10.  Multiply  (1) 
by  the  coefficient  of  "a"  in  (3)  to  form  equation  (7) . 
Multiply  (3)  by  the  coefficient  of  "a"  in  (1)  to 
form  equation  (8) . Subtract  (8)  from  (7)  to  form 
equation  (9),  with  the  unknown  "a"  eliminated. 

-Eliminate  the  unknown  "b"  between  equations  (6)  and 
(9) . Multiply  (6)  by  the  coefficient  of  "b"  in  (9) 
to  form  equation  (10) . Multiply  (9)  by  the  coeffi- 
cient of  "b"  in  (6)  to  form  equation  (11) . Subtract 
(11)  from  (10)  to  form  equation  (12)  with  the  unknown 
"b"  eliminated. 
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— Equation  (12)  can  be  readily  solved  yielding  the 
correct  value  of  "c".  That  is,  dividing  the  number 
on  the  right-hand-side  of  the  equals  sign  in  equa~ 
tion  (12)  by  the  coefficient  of  "c",  yields  the 
value  "c". 

— Substitute  the  value  of  "c"  from  Step  13  into  equa- 
tion (6)  and  solve  for  the  unknown  "b"  such  as  was 
done  in  Step  13 . 

--Substitute  the  values  of  "b"  and  "c"  into  equation 
(1)  and  solve  for  the  unknown  "a". 

— Substitute  all  three  values  "a",  "b",  and  "cM  into 
either  equation  (2)  or  (3)  to  check  the  answers.  If 
the  equation  does  not  yield  an  equality,  go  back  and 
check  each  step.  This  completes  the  solution  of  the 
set  of  simultaneous  equations. 

--The  equation  of  the  least-squares-best-fit  parabola 
can  be  written  with  the  values  "a",  "b",  and  "c" 
from  earlier  steps  as  follows: 

y = a + b»x  + c*x2  (13) 

— To  draw  the  curve  of  best- fit  one  will  have  to  plot 
several  of  the  points  of  the  parabola  described  by 
equation  (13) . Substitution  of  several  integer 
values  of  x^(x.®l,  2,  3,  etc.)  will  yield  several 
values  of  y^.  When  these  are  plotted,  a French  Curve 
may  be  used  to  draw  a line  connecting  them.  One  may 
wish  to  use  a different  color  of  pencil  for  the 
points  and  line  if  plotting  on  the  same  time-plot 
that  contains  the  original  data. 

--The  trend  described  by  equation  (13)  becomes  signifi- 
cant at  the  a- level  of  significance  (see  Step  6, 
Guidesheet  #3  or  #4)  when  it  touches  the  appropriate 
Control-Chart  limit  (UCL  or  LCL) . 
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Notes : 


1 j — Sometimes  it  is  apparent  by  inspection  which  unknowns 
should  be  eliminated  first,  making  the  numbers  work 
out  better.  Coefficients  being  multiples  of  each 
other  or  even  negatives  of  each  other  simplify  elim- 
ination. A little  practice  solving  sets  of  equations 
provide  one  with  the  ability  to  see  such  short-cuts 
but  may  not  be  assumed  common  knowledge  in  this 
general  solution  guide. 


~T/Jhen  a new  measurement  is  taken,  for  the  next  time- 
period,  a new  equation  describing  the  trend  must  be 
computed.  That  is,  all  the  steps  must  be  followed 
again.  This  is  not  necessary,  however,  if  the  new 
measurement  falls  on  or  very  close  to  the  trend  line. 
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EXAMPLE  for  Curvilinear  Least-Squares-Best-Fit 

(Guidesheet  #6) 


Steps : 

fl)--“>The  data  is  arranged  as  shown: 


Period 

Name 

(Kalf-Yr) 

Jul-Dec 

’71 

1st 

'72 

2nd 

'72 

1st 

'73 

2nd 

’73 

1st 

’74 

2nd 

'74 

1st 

'75 

2nd 

'75 

i 1st 

'76 

2nd 

'76 

1st 

'77 

| 2nd 

'll 

1st 

'78 

10  5 

11  6 

12  7 

13  8 

14  9 


iXi  ^i 


Meas- 

ure- 

ment 

y. 

J X 

Square 

X? 

i 

Cube 

X? 

l 

Fourth 

xi! 

i 

18.70 

14.90 

6.70 

5.73 

9.35 

21.20 

1 

1 

1 

20.60 

4 

8 

16 

23.30 

9 

27 

81 

23.30 

16 

64 

25.6 

27.40 

25 

125 

625 

30.20 

36 

216 

1296 

28.90 

49 

343 

2401 

29.60 

64 

512 

4096 

29. 0C 

81 

729 

6561 

x4  *y* 


21.2 

41.2 
69.9 

93.2 

137.0 

181.2 

202.3 
236.8 

261.0 


372.8 

685.0 

1087.2 

1461.1 

1894.4 

2349.0 


<ih- 


©- 


=45  =233.5  =285  -2025  =15333  =1243.8  =8117.8 

•-It  was  determined  that  not  all  the  above  data  should 
enter  into  the  test  for  trend  due  to  a "rules  change" 
in  the  last  half  of  1973  (x^=5) . The  table  above  and 
time-plot  is  renumbered  accordingly. 

•-Compute  each  square  x£: 


x2  = x *x  = 1*1  = 


x£  = X£*X2  = 2*2  = 
x|  = Xg*Xg  = 3*3  = 


xi  = x^*x«  = 9*9 


Record  in  Column  #4 


(T) — Compute  each  cube  x^- 


y 3 = 

X1 

xX*xl  “ 

1*1  = 

1 

Record 

in 

Column  # 

x|  = 

x2*x2  = 

4*2  = 

8 

tt 

tt 

1 1 it 

• 

• 

• 

• 

tt 

h 

tt  tt 

• 

• 

• 

• 

It 

n 

tt  tt 

• 

• 

• 

• 

tt 

tt 

It  tt 

3 — 

Ol  _ A _ 

-70  A 

tt 

it 

tf  tt 

9 9 


(^5) Compute  the  fourth  power  x£: 


x 


4 „ 3 


X1*X1  = = 


x2  = x2*x2 


8*2 


1 Record  in  Column  #6 
16 


Xg  = Xg*Xg  = 729*9  = 6561 


ft 

ft 

ft 


©---' 


x. 


x 


2*y2 


x( 


9*y9 


©---< 


x; 


x2  *y2 


the  product 

Vyi: 

» 1*21.2  = 

21.2  Record  in  Column  #7 

= 2*20.6  = 

41.2 

ft  tf 

tt  It 

• 

• 

tt  ft 

It  tt 

• 

• 

tt  It 

It  tt 

• 

• 

tt  tf 

tt  tt 

- 9*29.0  = 

261.0 

It  It 

tt  It 

the  product 

xi*yi: 

= 1*21.2  = 

21.2 

Record  in 

Column  #8 

= 4*20.6  = 

82.4 

tt  tf 

tt  It 

x, 


9’y9 


= 81*29.0  = 2349.0 


tt 

tt 

tf 


©- 


Sum  the  x^'s 


Zx^  = x^+X2-I"  * *+Xg  = l+2+***+9  = 45 


Record  in  Column  #2 
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Sum  the  y^’s: 

Zyi  = y]+y2+# ’ "tyg  = 21.2+20.6+** *+29.0  = 233 , 5 

Record  in  Column  #2 

Sum  the  remaining  columns  and  record  above. 

Ex?  = x^+x^i-*  * *+2Lg  = 1+4+**  *+81  = 285 

Ex*  - xi+x2+'“+x9  = 1+8+**  *+729  = 1025 

Ex£  = xj+xj+*  • «+x^  = 1+16+**  *+6561  - 15,333 

^xi'^i  = xl^l+x2^2+* * *+x9^9  = 21.2+41.2+* • *+261.0 

=1243.8 

Ex?  y ^ = x^y1+X2y2+*  * *+Xqyg  = 21. 2+82. 4+“  *+2349 

= 8117.8 


- 9a  + 45b  + 285c  = 233.5  (1) 

45a  + 285b  + 2025c  = 1243.8  (2) 

285a  + 2025b  + 15,333c  = 8117.8  (3) 

(10)  --Multiply  (1)  by  45: 

405a  + 2025b  + 12825c  = 10507.5  (4) 

Multiply  (2)  by  9: 

405a  + 2565b  + 18225c  = 11194.2  (5) 

Subtract  (5)  from  (4)  to  get: 

0 - 540b  - 5400c  =-686.7  (6) 

(11)  --Multiply  (1)  by  285: 

2565  +12825b  +81,225  = 66,547.5  (7) 

Multiply  (3)  by  9: 

2565  + 18225b  +137,997  = 73,060.2  (8) 

Subtract.  (8)  from  (7)  to  get: 

0 - 5400b  -56,772  = -6512.7  (9) 
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i. 

L 


f 

{ 


(l2)  — Note  that  the  coefficient  of  b in  Eq  (9)  is  a multi- 
ple of  the  coefficient  in  (6) . We  may  take  advantage 
of  this  and  come  up  with  Eq  (12)  much  easier.  Then 
we  will  do  it  the  long  way  (below)  to  show  both  are 
equivalent . 

Multiply  (6)  by  3.0  to  form  (10) : 

-5400b  - 54,000c  = -6867  (10) 

Recopy  (9):  -5400b  - 56,772c  = -6512.7  (11) 

Subtract:  0 + 2772c  = -354.3  (12) 

11  from  10 

2772c  - -354.3  (12) 

Solving  for  c,  we  get: 


„ _ -354.3  _ 100 

0 -W'128 

c = -.128 


13)  - 


-Longer  Method:  If  it  is  not  recognized  that  coeffi 
cients  of  the  unknowns  are  multiples,  we  have: 

Multiply  (6)  by  5400: 

2,916,000b  - 29,160,000c  = -3,708,180 

Multiply  (7)  by  540: 

2,916,000b  - 30,656,880c  = -3,516,858 

Subtract  (11)  from  (10)  to  get: 

1,495,880c  « -191,322 

-Solving  for  c,  we  get: 


c = -191,322  =_ 

1,495,800  ’ b 


(10) 

(31) 

(12) 


c = -.128 


Same  Answer. 
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(l4) --Substitute  c=-.128  into  Eq  (6)  and  solve  for  ”bM: 

-540b  - 5400c  = -686.7  (6) 

or  -540b  - 5400 (-.128)  = -686.7 
-540b  + 691.2  = -686.7 
-540b  = -686.7  - 691.2  =-1377.9 


b = 

b = 

--Substitute  c: 
solve  for  "a” 


zigLl=2.55 

-540 

2.55 

-.128  and  b=2.55 


into  Eq  (1)  and 


9a  + 45b  + 285c  = 233.5  (1) 

9a  + 45(2.55)  + 285  (-.128)  = 233.5 
9a  = 233.5  + 36.48  - 114.75  = 155.23 


a = 


loo.23 


= 17.2478 


©- 


i - 


a = 17,25 

-Substitute  these  values  of  a,  b,  and  c into  Eq  (2) 
to  check  the  computations : 

45a  + 285b  + 2025c  = 1243.8  (2) 

45(17.25)  + 285(2.55)  + 2025(-.128)=  1243.8 

776.25  + 726.75  - 259.2  = 1243.8 

1243.8  = 1243.8 

Yes,  exactly.  Computations  are  good. 

-The  equation  of  best-fit  curve  is  as  follows: 
y = 17.25  + 2. 55x  - .128x2 
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(Measurements  ) 


Plotting  points  for  x = 1,  3,  5,  and  7 should  give 
us  enough  to  draw  the  curve  as  shown  below; 

If  x— 1 , y = 17.25  + 2.55(1)  - .128(12)  = 19.67 

x=3,  y = 17.25  + 2.55(3)  - .128(32)  = 23.75 

x=5 , y = 17.25  + 2.55(5)  - .128(52)  = 26.8 

x=7 , y = 17.25  + 2.55(7)  - .128(72)  = 28.83 


- — - -Q-  ■O--’  ^ —A — 


7 8 9 10 

x^  (Time-Period) 

One  may  either  extend  the  curve  to  see  if  trend  is 
significant,  or  substitute  future  time-periods 
(forecast).  That  is,  if  x=9  is  the  present  time- 
period,  substitute  x=ll  and  plot  the  point  in  the 
time-plot  above : 

If  x— 11 , y = 17.25  + 2.55(11)  - .128(121)  = 29.8 

If  a standard  and  UCL=29.8  had  been  established,  as 
shown  on  graph,  the  trend  would  be  significant.  How- 
ever, if  a new  measurement,  say  at  x=!0,  produced  a 
more  "level"  curve  (after  computation  and  plotting) , 
this  curve  could  very  well  not  touch  the  UCL  at  x=ll 
and  the  new  curve  (trend)  would  not  be  significant. 
Note  that  the  value  30.2  is  above  29.8.  Obviously  the 
trend  was  significant  at  that  point,  but  not  necessar- 
ily after  lower  measurements  came  in. 


GUIDESHEET  #7.  The  Single  Moving  Average 


Assumptions : 


— This  method  is  used  where  there  is  very  little  trend 
apparent  from  the  time-plot,  but  one  wishes  to 
"smooth"  random  fluctuations  in  the  past  data.  It  is 
also  a lead-in  to  more  descriptive  methods. 


Steps : 


— Arrange  the  data  in  vertical  columns  (as  in  example) 
with  column  headings  as  follows . Column  4 will  be 
filled  in  later. 


©- 

©- 


Period 

Period 

Measurement 

Single  Moving 

Name 

Number 

Average 

x. 

yi 

M. 

i 

i 

! 

i 

i 

— Choose  "n",  the  total  number  of  observations  or 
measurements  which  are  to  be  tested  for  trend. 

--Choose  "N",  the  number  of  previous  points  you  wish 
to  use  in  the  computation  of  the  Single  Moving 
Average  (SMA) . If  there  are  less  than  15  total  obser- 
vations, use  N=5,  if  there  are  more,  one  may  use  more 
than  5,  but  more  than  N=10  becomes  unwieldy. 


Compute  the  first  N-period  moving  average.  It  is  the 
simple  average  of  the  first  N measurements.  Use  the 
following  formula.  Record  this  value  in  column  4. 


M. 


l 


!l±  = yl+y2+’  • ,+yN 
N N 


(1) 


Once  the  first  M^  has  been  computed,  an  easier  formula 
may  be  used  for  values  up  to  the  present.  Record  each 
in  column  4. 
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(2) 


~ yl-N 
N 


-Although  they  may  not  be  needed,  computing  moving 
averages  up  to  the  last  known  value  of  y "smooths" 
the  data.  Plotting  these  points  and  connecting  with 
short  lines  will  show  trends  which  appeared  in  Che 
past. 

-Either  superimpose  the  data  and  connecting  lines  onto 
a Control  Chart  (Guidesheet  #3  or  #4)  or  superimpose 
a Control  Chart  onto  the  graph  completed  in  this 
Guidesheet.  If  preparing  charts  for  a briefing,  it 
may  be  convenient  to  use  overlay  transparencies. 

-The  Control  Chart  limits  (whether  LCL  or  UCL  or  both) 
should  be  considered  the  point  where  the  trend 
becomes  significant.  Future  values  are  constant  and 
equal  to  the  latest  moving  average. 
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EXAMPLE  for  Single  Moving  Average  (Guidesheet  #7) 


1 

Period 

Name 

(Flight  Date) 

3 

Dec 

8 

Jan 

2 

Feb 

15 

Mar 

1 

Apr 

28 

May 

25 

c-i 

19 

Jul 

4 

Aug 

8 

Sep 

5 

Oct 

18 

Nov 

12 

Dec 

12 

Jan 

Period  I Measurement!  N-Penod 
Number  I 1 M 

Average 


18.7 

14.9 
6.07 
5.73 
9.35 

21.2 

20.6 

23.3 

23.3 

27.4 
30.2 

28.9 
29.6 
29.0 


-We  wish  to  test  all  the  data,  so  n=14 
-Since  n=14<15,  choose  N=5. 

-Compute  the  first  5-period  moving  average 


M5  “ 


i-i  7i  _ 

N N 


18.7+14.9+6.07+5.73+9.35  = 54J 
5 5 


M5  - 10.95 


(T) — Compute  Mg 


„ _ „ , y6  - y6-5  _ „ , y6  " yl 

M^  “ M/-  n + = = Mc  + 


M , = 10.95  + -2---2--1--7-  = 10.95  + 
o 5 5 

= 10.95  + .5  = 11.45 
Mg  = 11.45  Record  Mg  in  Column  4. 

Compute  moving  averages  up  to  M.^,  using  formula  (1) 

„ „ , y?  - y?-5  _ w , y?  ■ ^2 

M7  = M7-l  + 5 = M6  + 5 


= 11.45  + = 11.45  + 1.14  = 12.59 


M?  = 12.59 


Record  in  Column  4. 


I ji  . 

I'll 

'll 


i 


Mg  through  M^  are  computed  and  recorded  as  shown 
above. 

©—Plot  the  data  and  all  moving  averages 


All  future  (forecast)  values  are  equal  to  M^  until 
a new  measurement  comes  in. 

(T) — The  time-plot  above  shows  how  the  data  may  appear  in 
conjunction  with  a standard  and  UCL  (Guidesheet  #3 
or  #A) . 
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— Since  all  forecast  "values  for  this  method  are  equal 
to  the  latest  moving  average  the  above  plot  of 
moving  averages  "levels”  off.  Therefore,  it  would 
be  hard  to  say  this  trend  is  significant  unless  a 
point  had  already  touched  the  UCL. 


GUIDE SHEET  #8.  The  Double  Moving  Average 

Assumptions : 

@ — This  method  is  to  be  used  where  the  time-plot  cf  the 
data  suggests  a straight-line  trend,  but  one  wishes 
not  to  have  to  compute  a new  straight- line  equation 
each  time  a new  measurement  is  taken. 

Steps : 

(T) — Arrange  the  data  in  vertical  columne  (as  in  example) 
with  column  headings  as  follows . Columns  4 and  5 
will  be  filled  in  later. 


Name  I Number 


Measurement 

Single 

Double 

Moving  Avg. 

Moving  Avg. 

?i 

M&l 

M&] 

©- 


-Choose  n the  total  number  of  measurements  that  are 
to  be  tested  for  trend. 

-Choose  "N",  the  number  of  previous  points  you  with  to 
use  in  the  computation  of  the  Single  Moving  Average 
and  the  Double  Moving  Average. 

-Compute  the  first  N-period  Single  Moving  Average 
according  to  the  following  formula. 

sy.j  yi+y2+  +yN 


This  first  goes  in  the  Nth  space  down  the  table, 
column  #4. 

Once  the  first  M?  has  been  computed,  an  easier  form 
ula  may  be  used  for  subsequent  values.  These  are 
then  recorded  in  the  table. 


(2) 


N&3  = ^1_1  + 


yi  ' yi-N 


N 


-Compute  the  first  N-period  Double  Moving  Average  M? 
by  averaging  the  first  N Single  Moving  Averages 
according  to  the  following  formula.  This  goes  in 
the  N+Nth  space,  column  #5. 


£1$  mW+m|4-  • 

i N N 


Once  the  first  Double  Moving  average  has  been  com- 
puted an  easier  formula  may  be  used  for  subsequent 
values.  These  are  then  recorded  in  the  table. 


N 


(4) 


-Compute  the  quantity  a^,  a quantity  analogous  to  the 
y-intercept  when  finding  the  line-of-best-fit  in 
Guidesheet  #5. 


a.  - 21$  - (5) 

xii 

-Compute  the  quantity  b.,  a quantity  analogous  to  the 
slope  in  Guidesheet  #5. 


bi  = 


2 1 


N-l 


(M^1  - 


(6) 


-Compute  the  forecast  for  T periods  into  the  future, 
using  the  following  formula,  which  is  much  like  the 
equation  of  the  straight  line  developed  in  Guide- 
sheet  #5. 


yi+T  " ai  + bi’T 

Where  i is  the  present  period  number  and  T is  the 
number  of  periods  ahead  of  i you  wish  to  forecase. 
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Neither  moving  average  need  be  plotted  in  this  case, 
particularly  if  one  forecast  is  all  one  is  inter- 
ested in.  If  this  chart  is  to  be  maintained  over 
time,  however,  the  forecasts  should  be  plotted  as 
they  are  computed. 

For  trending,  either  superimpose  the  data  and  connect 
ing  lines  onto  a control  chart  (Guidesheet  #3)  or 
superimpose  a control  chart  onto  the  graph  completed 
in  this  guidesheet.  If  preparing  charts  for  a brief- 
ing, it  may  be  convenient  to  use  overlay  trans- 
parencies . 

The  control  chart  limits  (LCL,  UCL,  or  both)  should 
be  considered  the  point  where  the  trend  becomes 
significant.  That  is,  near-term  forecasts  can  be 
made  and  plotted.  If  one  touches  an  established  LCL 
or  UCL,  the  trend  is  significant. 


When  a new  measurement  is  taken,  a new  forecast  equa- 
tion, Eq  (7)  must  be  computed. 


EXAMPLE  for  Double  Moving  Average  (Guidesheet  #8) 


Number 

x. 


15 

Mar 

1 

Apr 

28 

May 

25 

Jun 

19 

Jul 

4 

Aug 

8 

Sep 

5 

Oct 

18 

Nov 

12 

Dec 

12 

Jan 

Measurement 

?i 

Single 
Moving  Avg. 

Double 
Moving  Avg. 

18.7 

14.9 

6.07 

5.73 

9.35 

10.95 

21.2 

11.45 

20.6 

12.59 

23.3 

16.04 

23.3 

19.55 

14.12 

27.4 

23.16 

16.56 

30.2 

24.96 

19.26 

28.9 

26.62 

22.07 

29.6 

27.88 

24.44 

29.0 

29.02 

26 . 33 

12+ 


--We  wish  to  use  all  the  data  for  the  test  so  n~14. 


Choose  N=5,  the  number  of  past  data  points  used  to 

predict  future  points . 

(J) Compute  the  first  5 -period  Single  Moving  Average 

~ (SMA) . 

5 

Jiyi  _ 


18.7+14.9+6.07+5.73+9.35 


M&]  10.95 


Now  compute  the  remaining  9 SMA’s  using  formula  (2) 


tiS  =.  wfii , + 7l.  y.t-g  = M , + lkU±l 
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16.56 


Continue  computations  of  DMA's,  record,  and  check 
answers  above . 

( IT ) — Compute  the  quantity  a^  = 2M^  - 1$ 

ai4  = 21^4  - - 2(29.02  - 26.33  = 


= 58.04  - 26.33  = 31.71 
a^4  = 31,71 

9) Compute  the  quantity  b.  = (if?  - 1$) 

b14  ■ <rf14  ’ rfj?4>  * |(29-02  - 26.33) 

- |(2.69)  = 1.34 

4> 

b14‘i^ 


10) — Any  forecast  T-periods  in 
using  Eq  (7) : 


the  future  nay  be  computed 


y±+T  = ajL  + t^T 

If  the  present  period  is  i=14,  and  we  wish  to  fore- 
cast T=1  period  into  the  future,  we  have: 

y14+l  = a14+b14*1  = 31*71  + 1-34*1 

= 31.71  + 1.34  - 33.05 
yq5  = 33.05 

If  we  wish  to  forecast  T=3  periods  into  the  future 
we  have: 

y14H-3  ” a14  + b14T  = a14  + b14*3 

=31.71  + (1.34)(3)  = 31.71  h 4.02  = 35.7 
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717  = 3LZ 

(ll)  & (l2) — If  there  were  a UCL  established  (Guidesheet 
#3  or  #4)  at  y=35.0,  this  trend  would  be  considered 
significant.  When  the  new  measurement  comes  in 
new  values  are  used  to  write  a new  Eq  (7) . See  time- 
plot  below. 

If  y15=27 .0, 


^15  = ^?4  + ~~  — = 29.02  + 27  • ? 

= 29.02  + y « 29.02  - .4  = 28.62 
M&l  = 28.62 


Record  in  table  and  use  in  next  computation. 

= 26.33  + j 23J-6 

= 26.33  + « 26.33  +1.09  = 27.42 

^15  - 27.42 

Record  in  table  and  use  in  next  computation. 


a15 

a15 


2jff5  - M|]5  = 2(28.62)  - 27.42 

57.24  - 27.42  = 29.82 
29.82 

0^5  - >f5)  = 1(28. 62-27. 42)  = 0.6 


bi5  - Li. 
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GUIDESHEET  #9.  Triple  Exponential  Smoothing 
Assumptions : 

© This  method  is  to  be  used  when  a time-plot  of  the 

data  suggests  a curved- line  trend,  but  one  wishes  not 
to  have  to  compute  a new  curved- line  equation  each 
time  a new  measurement  is  taken 


Steps : 

© Go  directly  to  Guidesheet  #6,  Curvilinear  Least- 

Squares-Best-Fit , and  follow  Step  2 through  Step  16. 
At  this  point  (Step  16)  values  will  have  been  com- 
puted for  unknowns  called  "a",  "b",  and  "c".  Return 
to  this  Guidesheet  (Step  2)  with  those  values. 

(^) Write  down  the  values  of  a,  b,  and  c computed  in 

Guidesheet  #6. 

©— Ar;  range  data  in  vertical  columns  (as  in  example) 

with  column  headings  as  follows.  A new  line  for  i=0 
is  added  at  the  top  for  later  use.  Columns  1,  2, 
and  3 will  be  identical  to  those  filled  out  for 
Step  1,  Guidesheet  #6.  Columns  4,  5,  and  6 will  be 
filled  in  later . 


Period 

Period 

Measurement 

Name 

Number 

xi 

*i 

) — Choose  the  value  of  r , the  smoothing  constant.  I 
has  been  found  that  if  an  N of  5 gives  good  perform 
ance  with  the  double  moving  average,  an  gives 

similar  performance  (Ref  8:92).  This  would  give  us 
r=.33.  To  make  it  easier  to  work  with,  a good 
starting  value  for  our  purposes  is  r=.3. 

) — Compute  the  initial  estimates  for  i=0  using  the 
values  of  a,  b,  and  c from  Step  2 above.  Use 


5T:< 


eV- 


..•a 


h*a 


equations  (1)  , (2)  , and  (3)  below  to  compute  S®,  S^*, 

and  S*. 
o 


' ■ 


' I1?)*  * [ 


(1-r) (2-r) 


= a - f2ikSl]b  + f2d-r)  (3-2r) 


SB1  . a - f3.feS>.]b  ttzM 


■ — For  r=.3,  the  equations  can  be  written  as  follows. 

Sg  = a - 2.33b  + 13.2c  (4) 

Sg  = a - 4.67b  + 37.3c  (5) 

S®  - a - 7b  + 72.3c  (6) 

For  other  equations  for  different  values  of  "r",  see 
Note  #1. 

■ — Compute  the  single,  double,  and  triple  exponentially 
smoothed  statistics  for  time  i=i  using  the  following 
formulas.  Record  these  values  in  Columns  4,  5,  and  6, 


s¥  - -3yx  + 

^ = . 3S®  + . 7S® 
Sf  = . 3S^  + . 7S® 


■-Continue  computing  the  exponentially  smoothed  statis- 
tics for  i=2  through  the  present  time-period  using 
formulas  (10),  (11),  and  (12)  shown  here.  Record 
each  statistic  in  the  appropriate  columns. 
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(lO) — Write  the  forecasting  equation  for  period 
coefficients  from  Step  9 as  follows . 


"i"  with 


yi+T 


ai 


V . fp 

ui* 


- .T2 

"i  * 


(16) 


Where  T denotes  the  number  of  periods  in  the  future 
one  wishes  to  forecast.  That  is,  the  forecast  for 
T=1  period  in  the  future  would  be: 


yi+T  = yi+l 


— a.  + b . + c . 


(17) 


llj --Whenever  a new  data-point  must  be  computed,  new 

statistics  S$,  S$,  and  S$  must  be  computed.  These 
are  computed  by  formulas  (10) , (11) , and  (12)  from 
Step  8.  These  are  recorded  in  the  table  and  used  in 
Step  12. 

Using  the  new  statistics  from  Step  11  and  formulas 
(13),  (14),  and  (15)  from  Step  9.  we  compute  new 
coefficients  a^,  b_. 


12  — 


and  c^. 


13  - 


With  the  new  coefficients  a^,  b^,  and  c^  computed  in 
Step  12,  we  may  write  the  new  forecasting  formula. 


if 
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(18) 


yi+T  _ ai  + VT  + V1* 


Again,  T is  the  number  of  periods  in  the  future  a 
forecast  is  needed. 

(l4) — When  used  in  conjunction  with  a control  chart,  one 
may  say  a trend  is  significant  at  the  a level  of 
significance  (see  Guidesheet  #3  or  #4)  when  a fore- 
cast touches  the  appropriate  control  chart  limit. 


Notes : 

© — For  a smoothing  constant  of  r=.l  instead  of  r=.3  our 
equations  (4) , (5) , (6) , (10)  through  (15)  are  as 
follows : 


S“  - a - 9b  + 17c 

(4) 

S®  = a - 18b  + 504c 

0 

(5) 

Sg*  = a - 27b  + 999c 

(6) 

SP1=  .ly.  + •9S^_1 

(10) 

sf  = . isf  + .9S^_1 

(ID 

S®  - .1S|1  + .9S^_1 

(12) 

ai  = + S?  (no  change) 

(13) 

b±  = . 062  (5 .5S|3  - 9 . + 3 . 7S&) 

(14) 

ci  = .0062(3^  - 2S®  + S$) 

(15) 

If  it  is  necessary  to  use  other  values  of  r,  the  main 
text  of  the  thesis  must  be  consulted  for  the  general 
formulas . 
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EXAMPLE  for  Triple  Exponential  Smoothing  (Guidesheet  #9) 


Steps : 


© 


1 J — We  will  use  the  same  data  as  the  example  for  Guide- 
sheet  #6. 


a-17.25,  b=2.55,  c=-.128 
(T) Table  as  shown: 


Period 

Name 

Half-Yr 

Period 

Number 

X. 

1 

Meas- 

urement 

*i 

& 

X 

sta 

1 

s® 

X 

0 

9.62 

0.567 

-9.85 

1st  '74 

1 

21.2 

13.09 

4.33 

-5.60 

2nd  '74 

2 

20.6 

15.34 

7.63 

-1.63 

1st  ' 75 

3 

23.3 

17.73 

10.66 

2.05 

2nd  '75 

4 

23.3 

19.40 

13.28 

5.42 

1st  *76 

5 

27.4 

21.80 

15 . 84 

8.55 

2nd  '76 

6 

30.2 

24.32 

18.38 

11.50 

1st  '77 

7 

28.9 

25.69 

20.57 

14.22 

2nd  '77 

8 

29.6 

26.86 

22.46 

16.69 

1st  '78 

9 

29.0 

27.50 

23.97 

18.87 

©- 

©- 

©-■ 


-We  choose  an  "r" : r=0.3 

-Skip  to  the  worked-out  formulas  from  Step  6. 

-Compute  the  estimates  for  the  initial  exponentially 
smoothed  statistics  according  to  equations  (A-) , (5) 
and  (6) . 


Sg  = a-2.33b+13.2c  = 17. 25-2. 33(2. 55)+13.2C-. 128) 

= 17.25  -5.94  - 1.69  = 9.62 

SM  =9.62 
o 

Record  this  value  above,  across  from  x.j=0  in  Column 


Sg  = a-4. 67b+37. 3c  = 17.25-4.67(2.55)+37.3(-.128) 
= 17.25  - 11.91  - 4.77  = 0.567 
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A 
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-We  may  now  write  our  forecasting  equation  as  Eq  (16) 
shows : 


?i+T 


a.  + b. «T  + c.  *T2 
11  i 


or,  at  the  p_.csent  time  (i=9)  we  have: 
y9+T  = a9  + b9-T  +c9*T’- 

16  we  wish  to  forecast  1 period  beyond  i=9,  T-l, 
ai’  ^i»  an^  cd  are  ^rom  Step  9 and  we  have: 


y9+1  = 29.46  + .023(1)  - .144(12) 


y1Q  = 29.46  + .023  - .144  = 29.339  or  29.3 


If  we  wished  to  forecast  3 periods,  we  would  have: 
y9+3  - y12  = 29. 46+. 023(3)+. 144(32) 

= 29. 46+.069-. 144(9)  =28.2 


-Say  the  next  period's  measurement  comes  in,  i.e. 
y 10=2 7.0.  We  must  first  compute  the  statistics 

S^q,  and  S^q.  We  again  use  equations  (10),  (11), 

and  (12) : 

.3(27.0)+. 7(27. 5)  = 27.35 

.3(27. 35)+. 7(23. 97)  = 24.98 

.3(24.98)+. 7(18.87)  = 20.70 

coefficients  for  our  new  fore- 
casting equation  using  equations  (13),  (14),  and  (15) 

a10  = 3 S^q  - 3 S^q+S^o  = 3(27. 35)-3(24.98)+20.7 
= 27.8 


sTo  - • 3yio+-  7S?  ■ 

S10  = -3S?0+-7d;l  - 

sfo  = • 3S¥o+- 7Sf  " 

-We  may  now  compute  the 
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p 


& 


I 


f 

£ 


1 

f 


b10  - .3X(4.5sM0-7.6sf0+3.1sM) 

= . 31 (4. 5(27. 35) -7. 6(24. 98)+3. 1(20. 7)) 

= .3K-2.60)  = - ■ 807 

cio  = • 092 (sio'2sio+sfo^ 

= .092(27. 35-2(24. 98)+20. 7)  =.092(-1.91) 

- -.176 

13) — With  these  new  values  of  a,  b,  and  c we  may  write  a 
new  forecasting  equation: 

y10+T  = a10  + b10*T  + C10*T2 
y10-r-T  = 27 * 8“* 807 --176-T2 


14  - 


-The  data  is  plotted  as  follows.  If  there  were  a 
standard  as  shown  and  a UCL  at  y=35.0,  a trend  would 
be  significant  (at  chosen  a)  if  a near-term  fore- 
cast reached  the  UCL.  In  our  example  it  does  not. 


Note  that  the  forecast  for  y-^  in  Step  10  is  28.2. 
The  forecast  for  y ^ using  the  new  equation  in  Step 
13  is  y12=25 . 5 . This  shows  the  effect  of  a new  data 
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GUIDESHEET  #10,  Parametric  Test  for  Correlation 
Assumptions : 

@ — It  is  assumed  that  there  are  2 types  of  measurements 
here,  and  that  one  is  interested  in  whether  there  is 
positive,  negative,  or  zero  relationship  between 
them.  This  is  not  a trend  test. 


Steps : 

0 — Arrange  the  data  in  vertical  columns  (as  in  example) 
with  column  headings  as  follows . Columns  3 through 
6 will  be  filled  in  later. 


Measurement 

#1 

Measurement 

#2 

Square 

Square 

Product 

X. 

l 

*i 

y • 

x.  »y . 
l Jx 

J 

r J 

' > 

- j 

“I 

* 

1 

1 

■h 

Sy.= 

Sx|* 

zyj- 

sVi" — 

©~ 

®~ 


-Plot  the  measurements  jointly,  one  axis  for  each 
measurement.  Do  not  connect  any  of  the  plotted 
points  with  lines . 

-If  it  will  simplify  computation,  the  measurements 
may  be  divided  by  10 , 100 , or  whatever . If  there 
are  more  than  three  significant  digits,  also,  round 
off  to  three.  These  two  things  will  not  affect  the 
test  significantly  and  will  prevent  working  with 
extremely  large  numbers.  During  computation,  how- 
ever, carry  as  many  significant  figures  as  possible 
Likewise,  if  the  numbers  are  very  small  (.000236  for 
example)  and  vary  in  the  fourth  to  sixth  decimal 
place,  say,  multiply  by  a large  number  (1000  in  this 
case)  to  make  the  numbers  manageable. 

Compute  the  square  of  each  x^  measurement.  That  is, 

compute  x?=  xi*xi  atl^  record  in  the  designated  column. 
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•Compute  the  square  of  each  y^  measurement.  That  is, 
compute  y^y^y^  an^  record  in  the  designated  column. 

•Compute  the  product  of  each  x^  times  each  y^.  That 
is,  compute  the  product  x^*y^,  and  record  in  the 
last  column. 

Add  up  each  column  to  compute  the  following  sums : 

“xi’  ^iJ  2xi’  2^i’  Sxi'^i*  ^-ecord  each  total  in  the 
designated  place  at  the  bottom  of  each  column. 

Compute  the  quantity  of  S according  to  the  follow- 
ing  formula,  where  "n"  is  the  number  of  measurements. 


S = nTx?  - f 7 x.) 
xx  L i 1 4 xJ 


(1) 


Compute  the  quantities  and  Sxy  according  to  the 
following  formulas. 


10)  — 


Syy  “ n^yi  ' 2 


sxy  = nK‘yi  ' 


Compute  the  Sample  Correlation  Coefficient  "r", 
according  to  the  following  formula. 


(2) 

(3) 


r = 


xy 


/s  • S 


xx  yy 


(11)- 

©- 

©- 


(4) 


-Choose  a significance  level  a. 

-Look  up  the  value  of  the  z- statistic  corresponding  to 
the  a/2  picked  in  Step  11  from  the  table  in  Note  3. 
This  is  the  table  value  of  the  test  statistic  z (Table), 

-Compute  the  value  of  Z,  an  intermediate  statistic, 
according  to  the  following  formula. 

fl+r 

1-r. 


Z = %ln 


(5) 
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If  calculator  does  not  have  the  In  function,  the 
table  in  Note  4 gives  values  of  Z for  various  values 
of  r. 

Compute  the  computed  value  of  the  test  statistic, 
z (Computed)  according  to  the  following  formula, 
using  Z from  Step  13,  and  n from  Step  8. 

z (Computed)  = /n_3  *Z 

(l5)  — If  z (Computed) <z (Table)  for  positive  values  of  r, 
then  there  is  no  significant  correlation.  If 
z (Computed) >-z (Table)  for  negative  values  of  r,  there 
is  no  significant  negative  correlation.  If 
z (Computed) >z (Table)  for  positive  values  of  4,  there 
is  a positive  correlation,  and  if  z (Computed) <-z (Table) 
for  negative  values  of  r,  there  Is_  significant  nega- 
tive correlation. 


Notes : 


1 J Step  13  may  be  regarded  as  a test  of  the  hypothesis 

Hq : p=0  where  p is  the  value  of  the  true  but  unknown 
relationship  between  the  measurements.  The  altema 
tive  hypothesis  is  H^:p^0.  If  z (Computed)  lies 
between  -z (Table)  and  z (Table)  we  accept  Hq.  If 
z (Computed)  lies  outside  these  limits,  we  must 
reject  HQ. 


Any  significant  correlation  may  be  due  to  a third 
factor,  or  in  rare  cases  (a%  of  the  time),  chance. 
Take  this  into  consideration  when  making  decisions 
about  what  causes  what. 


d> 


3 ) Table  values 


of  the  z-statistic  are  given  here: 


a = 0.10 

0.05 

0.01 

a/2  = 0.05 

0.025 

0.005 

z (Table)  = 1.645 

1.96 

2.575 

(Scores 


- ,i 


EXAMPLE  for  Parametric  Test  for  Correlation  (Guidesheet  #10) 
Steps : 

© — This  is  an  example  of  how  a trend  analysis  officer 

might  test  the  relationship  between  proficiency  test 
scores  and  bombing  CEPs.  The  scores  and  CEPs  are 
arranged  as  follows.  Columns  3-6  are  filled  in 
later.  This  is  not  a test  for  trend. 


nc  Measurement 
4 (CEP) 

y; 


Square 

x2 

xi 

Square 

n 

8281 

4 

8372.25 

25 

8464 

49 

8649 

6.25 

8742.25 

36 

8836 

6.25 

9025 

25 

9216 

16 

9409 

36 

9506.25 

16 

9604 

4 

9801 

6.25 

Product 


457.5 
644 

232.5 
561 
235 
475 
384 
582 
390 
196 

247.5 


Ey.=  Zx2= 


1137.5 


48.5  107907.75  229.75  4586.5 


x,  (Borrbing  CEP) 


— In  order  to  avoid  very  large  numbers,  measurements 
y£  are  divided  by  100  become  our  y..  values , and  are 
recorded  in  Column  #3. 

■--Computing  each  square  of  Test  Scores  we  have: 
x?  = x^x^  or 

x£  =•  = 91*91  = 8281  Record  in  Column  #4 


x2  = x2  *x2  = 91-5'91-5  = 8372.25 


x12~x12*x12”  = 9801  " " 

) Computing  each  square  of  Bombing  CEPs  we  have: 

y|  - 7i‘7i  or: 

yl  = yl*yl  = - 4 Record  in  Column  #5. 


y2  = y2*y2  = 5*5  = 25 


It  If 


y12=y12 *y12=2 • 5 ’ 2 • 5 = 6.25  " " 

) — Computing  each  product  of  measurements  we  have: 

x±'y±  or 

x^*y^  = 91  • 2 = 182  Record  in  Column  #8 


•y2  = 91.5*5  = 457.5 


II  II  II 


x12*y12  = 99  *2,5  = 247'5  " " " 

) — Computing  the  sum  of  each  column  we  have: 

Exi  = x1+x2+* • *+x12  = 91+91.5+** *+99  * 1137.5 
Record  as  shown. 


BBsmsT^  ,"‘c«aaBpe- « 


2yi  = yl+y2+* * *+y12  = 2+5+** *+2.5  = 48.5 
Record  as  shown. 

2x?  = x2+x£+- • *+x£2  =8281+8372 . 25+» * -+9801=107905 . 75 
Record  as  shown. 

Ey£  = y|+y2+* * *+^12  “ 4+25+* • *+6. 25  = 229 . 75 
Record  as  shown. 

£V  yi  ' X1 ' y l+x2 ' y2+  ‘ ' +x12  'y12 

= 182+457 .5+* • *+247.5  = 4586.5 
Record  as  shown. 

(IT) There  are  12  measurements  of  each  kind  so  n=12. 

Computing  the  quantity  S according  to  Eq  (1) , we 

XX 

have: 

S*x  - » H - 

=12(107905. 75)-(1137.5)2 
=1,294,869  - 1,293,906.25 
sx::  = 962,75 

S will  be  used  in  Step  10. 

XX 

(IT) — Computing  the  quantities  S and  Sxy  according  to 
Eqs  (2)  and  (3)  we  have: 

Syy  = n^i  - (Zyi) 2 

=12(229. 75)-(48.5)2  = 2757-2352.25 


Syy  = 404,75 
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? 

z (Confuted)  z (Table) 

? 

- . 639--1.96 

Yes,  it  is_  greater.  We  conclude  that  there  is  no 
significant  correlation  at  this  a~. 05  level  of 
significance.  That  is,  on  the  basis  of  this  sample, 
we  cannot  say  that  high  proficiency  scores  are 
associated  with  low  CEPs . 

Here  is  an  example  of  Note  2,  page  109.  In  our 
example  we  should  not  say  high  scores  cause  low 
CEPs.  It  may  be  more  appropriate  to  say  that  high 
proficiency  (third  factor)  causes  both  high  pro- 
ficiency scores  and  low  CEPs.  But  on  the  other 
hand  the  third  factor  could  be  cheating,  which  also 
causes  high  scores  and  low  CEPs.  The  real  thing 
going  on  is  hard  to  measure.  Statistics  just  helps 
measure  the  strength  of  a relationship. 
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GUIDESHEET  #11,  The  Cox  and  Stuart  Test  for  Trend 


Assumptions : 


-This  method  is  used  when  the  assumptions  required  for 
Control-Chart  significance- testing  are  not  valid 
assumptions.  Also,  this  method  allows  quicker  signi- 
ficance-testing in  most  cases,  though  it  is  not  as 
powerful  a test. 


Steps : 


— Arrange  the  data  in  vertical  columns 
with  column  headings  as  follows. 


(a3  in  example) 


Period 

Period 

Measurement 

Name 

Number 

x. 

n ■ -j 

l 

Tally 

- 

-Count  the  total  number  of  observations  or  measure- 


ments that  are  to  be  tested  for  trend.  Call  this 


number  "N". 


3 J If  N is  odd,  discard  the  middle  value  by  drawing  a 

heavy  line  through  the  middle  period,  dividing  the 
data  into  two  equal  parts.  If  N is  even,  draw  the 
heavy  line  between  the  first  and  second  halves, 
dividing  the  data  into  equal  parts.  Sue  example. 


-Draw  a line  connecting  the  1st  measurement  (y-^)  in 
the  1st  half  to  the  1st  measurement  in  the  2nd  half. 
As  this  is  done,  note  whether  the  2nd  half  value 
is  greater.  If  it  is , mark  a tally  in  the  "tally" 
column  to  the  right  of  the  table. 


Continue  connecting  corresponding  measurements  and 
continue  tallying  each  time  the  second  half  value  is 
greater  than  the  first  half  value. 
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-Count  the  total  number  of  lines.  This  is  called  "n". 
It  should  be  half  the  number  of  measurements  after 
discarding. 


(jT)---Count  the  tally.  This  is  T (Computed) 

(IT) — Choose  a significance  level  a.  A significance  level 
of  a=.05  or  a=.10  is  usually  appropriate. 


-Form  the  null  hypothesis  KQ  and  alternative  hypoth- 
esis H-. . These  may  be  picked  from  the  three  choices 
below. 


(A) 

H : 
0 

Hr 

(B) 

V 

Hr 

(G) 

V 

Hr 

There  is  no  downward  trend. 

There  is  a downward  trend. 

There  is  no  upward  trend. 

There  is  a significant  upward  trend 
in  the  data 

There  is  no  significant  trend. 

There  is  either  an  upward  trend  or  a 
downward  trend  in  the  data. 


If  (A)  is  chosen,  continue  with  Step  10.  If  (B)  is 
chosen,  go  to  Step  15.  If  (C)  is  chosen,  go  to 
Step  20. 

(10)  — Enter  Table  II  on  page  123,  look  up  and  down  the 

column  of  numbers  associated  with  p=.50  and  the  n 
from  Step  6.  Find  the  4-place  decimal  most  nearly 
equal  to  the  chosen  value  of  a (Step  8) . 

(11) --Pick  the  value  directly  to  the  left  of  the  4-place 

decimal  found  in  Step  10  (in  the  column  labeled  "x") 
and  call  this  value  T(Table). 


12) --If  T(Computed)  from  Step  7 is  greater  than  or  equal 
to  T(Table),  accept  the  null  hypothesis  K : There 


is  no  downward  trend.  That  is,  if  T (Computed) >T (Tab le)  , 
accept  Hq. 
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(li) — If  T(Computed)  is  less  than  T(Tafcle),  reject:  the 

null  hypothesis  Hq  in  favor  of  the  alternative  hypoth- 
esis : There  is  a significant  downward  trend.  That 
is,  if  T (Computed) <T (Table) , accept  H^. 

(l4) --The  4-place  decimal  found  in  Step  10  is  the  actual 

significance  level  for  the  value  of  the  computed  test- 
statistic  for  this  data.  This  step  completes  the 
test  for  a downward  trend. 


©- 


15) --Enter  Table  II  on  page  123,  look  up  and  down  the 
column  of  numbers  associated  with  p=.50  and  the  n 
from  Step  6.  Find  the  4-place  decimal  most  nearly 
equal  (preferably  less  than)  the  chosen  value  of  a 
(Step  8) . 

-Pick  the  value  directly  to  the  left  of  the  4-place 
decimal  found  in  Step  15  (in  the  column  labeled  'V)  , 
and  subtract  this  number  from  n.  This  is  the  table 
value  of  the  test-statistic  T(Table). 

17)  --If  T (Computed)  from  Step  7 is  less  than  or  equal  to 
T(Table),  accept  the  null  hypothesis  Hq:  There  is  no 
upward  trend.  That  is,  if  T (Computed) -T (Table) , 
accept  Hq. 

18)  --If  T(Computed)  is  greater  than  T(Table),  reject  the 
null  hypothesis  H0  in  favor  of  the  alternative 
hypothesis  R-^:  There  is  a significant  upward  trend. 
That  is,  if  T (Computed) >T (Table) , accept  H^. 

19)  --The  4-place  decimal  found  in  Step  15  is  the  actual 
significance  level  for  the  value  of  the  computed 
test-statistic  and  this  data.  This  step  completes 
the  test  for  an  upward  trend. 


— Enter  Table  II  and  look  up  and  down  the  column  of 
numbers  associated  with  p=.50  and  the  n from  Step 


6 . 
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Find  the  4-place  decimal  most  nearly  equal  (prefer- 
ably less  than)  to  ot/2  where  u is  the  value  chosen 
in  Step  8. 

2l)  --Pick  the  value  directly  to  the  left  of  the  4-place 

decimal  found  in  Step  20  (in  the  column  labeled  "x") , 
and  call  this  value  T(Table),  the  table  value  of  the 
test-statistic. 


22)  — If  T (Computed)  from  Step  7 is  greater  than  or  equal 
to  T(Table)  and  less  than  or  equal  to  n minus  T(Table), 
accept  the  null  hypothesis  HQ:  There  is  no  signifi- 
cant trend.  That  is,  if  T (Table) ST (Computed) 5 
n-T(Table),  accept  the  null  hypothesis  Hq. 

23)  — If  T (Computed  is  less  than  T (Table)  or  greater  than 
n minus  T (Table),  reject  the  null  hypothesis  Hq  in 
favor  of  the  alternative  hypothesis  H^:  There  is 


either  an  upward  or  downward  trend.  That  is,  if 
T (Computed) <T (Table)  or  T (Computed) >n-T (Table) , 
reject  Hq  in  favor  of  H-^. 

(24) — The  4-place  decimal  (multiplied  by  2)  found  in  Step  20 
is  the  actual  significance  level  for  this  data  and 
this  test-statistic.  This  step  completes  the  test  for 
trend. 
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EXAMPLE  for  Cox  and  Stuart  Test  for  Trend  (Guidesheet  #11) 


Period 

Period 

Name 

Number 

i xi 

nil 


Sept 

Nov 

Dec 

Jan 

Feb 

Mar 

Apr 


U JL  . / ^ 

89. 9J 
90.1* 
90.8* 
90.5* 
91.4- 
89.9- 
91.1- 


lllllllll 


T (Computed) =// 


CD- 

©- 


©- 


©~ 

®- 


-We  wish  to  use  all  available  data  for  this  test, 
because  we  feel  it  is  all  relevant.  Therefore  N=15. 

-Since  N is  odd,  we  discard  the  middle  value,  yg. 

Draw  a heavy  line  through  period  #8  to  divide  the 
measurements  into  a first  half  and  a second  half,  as 
shown  above. 

-Draw  a line  connecting  the  1st  measurement  in  the 
first  half  to  the  1st  measurement  in  the  second  half 
We  note  2nd  half  value  is  not  greater,  but  the  same, 
so  we  do  not  tally. 

-As  we  continue  connecting  measurements,  we  find  only 
one  value  in  the  second  half  is  greater  than  the 
corresponding  value  in  the  first  half. 

-n=7 

-The  number  in  the  tally  is  2.  Therefore  T (Computed) 
=2,  our  computed  test-statistic. 


S 


I 


c 


1 


1 


f 

b 

i' 

i 


I - 


i 

I 


1 


li 

'! 

'i 

» 

i( 

if 


-We  choose  a significance  level  a=.05  for  this  test. 

-We  are  interested  in  whether  there  is  a significant 
downward  trend  or  not  and  do  not  care  to  tell  if  there 
is  an  upward  trend,  so  we  choose  (A). 


(lo)--We  go  to  Table  II,  starting  on  page  123,  and  look  at 
the  column  associated  with  p=.50  and  n=7.  We  look 
for  the  4-place  decimal  closest  to  a=.05.  We  find 
.0625  is  the  closest  in  the  column.  A portion  of  the 
table  is  shown  with  these  values  marked.  The  .0625 


now  becomes  the  exact  significance  level  for  this 


-• 

That 

is, 

a= . 

0625 

• 

— X- 

n 

X 

0.05 

0.10 

0.15 

0.20 

P 

0.25 

0.30 

0.35 

0.40 

0.45 

T 

0.50 

2 

2 

0.9025 

0.8100 

0.7225 

0.6400 

0.5625 

0.4900 

0.4225 

0.3600 

0.3025 

0.2500 

1 

0.9975 

0.9900 

0.9775 

0.9600 

0.9375 

0.9100 

0.8755 

0.8400 

0.7975 

0.7590 

3 

0 

0.8574 

0.7290 

0.6141 

0.5120 

0.4219 

0 3430 

0.2746 

0.2160 

0.1664 

0.1250 

1 

0.9928 

0.9720 

0.9392 

0.8960 

0.8438 

0.7840 

0.7182 

0.6480 

0.5748 

0.5000- 

2 

0.9999 

0.9990 

0.9966 

0.9920 

0.9844 

0.9730 

0.9571 

0.9360 

0.9089 

0.8750 

4 

0 

0.8145 

0.6561 

0.5220 

0.4096 

0.3164 

0.2401 

0.1785 

0.1296 

0.0915 

0.0625 

1 

0.9860 

0.9477 

0.8905 

0.8192 

0.7383 

0.6517 

0.5630 

0/.752 

0.3910 

0.3125 

2 

0.9995 

0.9963 

0.9820 

0.9728 

0.9492 

0.9163 

0.8735 

0.8208 

0.7585 

0.6875 

3 

I.GOOO 

0.9999 

0.9995 

0.9984 

0.9961 

0.9919 

0.98S0 

0.9744 

0.9590 

0.9375 

5 

0 ' 

0.7738 

0.5905 

0.4437 

0.3277 

0.2373 

0.1681 

0.1160 

0.0778 

0 0503 

0.0312 

1 

0.9774 

0.9185 

0.8352 

0,7393 

0.6328 

0.5282 

0.4284 

0.3370 

0.2562 

0.1875 

2 

0.9988 

0.9914 

0.9734 

0.9421 

0.8965 

0.8369 

0.7648 

0 6 826 

0.5931 

0.5000 

3 

1.0000 

0.9995 

0.0078 

0.9933 

0.9844 

0.9692 

0.9460 

0.9130 

0.8688 

0.8125 

4 

1.0000 

1.0000 

0.9999 

0.9997 

0.9950 

0.9976 

0.9947 

0.9898 

0.9815 

0.9688 

6 

0 

0.7351 

0.5314 

0.3771 

0.2621 

0.1780 

0.1176 

0.0754 

0.0467 

0.0277 

0.0156 

i 

0.9672 

0.8857 

0.7765 

0.6554 

' 0.5339 

0.4202 

0.3191 

0.2333 

0.1636 

0.1094 

2 

0.9978 

0.9842 

0 9527 

0.9011 

0.8306 

0 7443 

0.6471 

0.5443 

0.4415 

0.3438 

3 

0.9999 

0.9987 

0.9941 

0.9830 

0.9624 

0.9295 

0.3826 

0.8208 

0.7447 

0.6562 

4 

1.0000 

0.9999 

0.9996 

0.9984 

0.9954 

0.9891 

0.9777 

0.9590 

0.9308 

0.8906 

l 

3 

1.0000 

1.0000 

1.0000 

0.9999 

0.9998 

0.9993 

0.9982 

0.9959 

0.99)7 

0.9844 

■wro/ff  0.4783  0.3206  0.2097  0.1335  0 0824  0.0490  0.0280  OOTFt-vJJjlQlL 

0.9556  0.8503  0.7166  0.5767  0.4449  0.329  1 0.2338  0.1586  0.1024  Q.062SJ 

0.9962  0.9743  0.9262  0.8520  0.7564  0 6471  0.5323  0.4199  0.3164  0.2266 


11  — 


12)  — 


The  value  straight  across  the  table  from  .0625  in  the 
"x"  column  is  "1".  This  is  our  value  T(Table).  See 
the  portion  of  the  table  above.  We  have  T(Table)=l. 

T (Computed)  is  greater  than  T (Table).  That  is: 

T (Computed) >T (Table) 


2>1 

We  must  accept  the  null  hypothesis  H : There  is  no 


significant  trend  at  the  a=.0625  level  Our  tally 
(Step  7)  would  have  needed  to  be  zero  for  the  trend 
to  be  significant  at  this  a- level. 

(0)--This  step  io  not  applicable,  Step  12  was. 

(14)  — Again,  .0625  is  our  exact  level  of  significance. 

(15)  --Steps  15  through  19,  choice  (B) , the  test  for  upward 

trend,  goes  much  the  same  way.  Choice  (C)  a bit 
more  involved,  is  demonstrated  here. 

(20) --We  go  to  Table  II  and  look  for  the  4-place  decimal 
closest  to  a/2  or  .05/2=. 025.  That  is,  looking  in 
the  column  associated  with  p=.50  and  n=7,  we  find 
.0078  is  closest  to. 025. 

21) --We  find  the  value  in  the  x-column  opposite  to  .0078 
is  0.  Thus  our  test-statistic  is  T(Table)=0. 

--Our  T(Computed)=2  0,  but  must  check  to  see  if  it  is 
less  than  n-T(Table).  We  find  n-T(Table)=7-0=7  and 
T (Computed)  i£  less  than  this  value. 

T (Table) <T (Computed) <n-T (Table) 

0<2<7 

We  must  accept  Hq:  No  trend. 

--Not  applicable,  Step  22  satisfied  our  test. 

-Actual  exact  significance- level  here  is  2 (.0078) 
=.0156. 
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Table  XI 

The  Binomial  Distribution 

(Ref  4:477) 


P*(l-p)"'* 


p 


n 

X 

0.05 

0.10 

0.15 

0.20 

0.25 

0.30 

0.35 

0.40 

0.45 

0.50 

2 

2 

0.9325 

0.8100 

0.7225 

0 6400 

0.5625 

0.4900 

0.4225 

0.3600 

0.3025 

0.2500 

1 

O.S975 

0.9900 

0.9775 

0.9600 

0.9375 

0.9100 

0.8755 

0.8400 

0.7975 

0.7500 

2 

0 

0.8574 

0.7290 

0.6141 

0.5120 

0.4219 

01430 

0.2746 

0.2160 

0.1664 

0.1259 

1 

0.9928 

0.9720 

0.9392 

0.8960 

0.8438 

0.7840 

0.7182 

0.6480 

0.5748 

0.5000 

2 

0.9999 

0.9990 

0.9966 

0.9920 

0.9844 

0.9730 

0.9571 

0.9360 

0.9089 

0.8750 

4 

0 

0.8145 

0.6561 

0.5220 

0.4096 

0.3164 

0.2401 

0.1785 

0.1296 

0.0915 

0.0625 

1 

0.9860 

0.9477 

0.8905 

0.8192 

0.7383 

06517 

0.5630 

0.4752 

0.3910 

0.3125 

2 

0.9995 

0.9963 

0.9880 

0.9728 

0.9492 

0.9163 

0.8735 

0.8208 

0.7585 

0.6875 

3 

1.0090 

0.9999 

0.9995 

0.9984 

0.9961 

0.9919 

0.9850 

0.9744 

0.9590 

0.9375 

5 

0 

0.7738 

0,5905 

0.4437 

0.3277 

0.2373 

0.1681 

0.1160 

0.0778 

0 0503 

0.0312 

1 

0.9774 

0.9185 

0.8352 

0.7373 

0.6328 

0.5282 

0.4284 

0.3370 

0.2562 

0.1875 

2 

0.9988 

0.9914 

0.9734 

0.9421 

0.81*65 

0.8369 

0.7648 

0.6826 

0.5931 

0.5000 

3 

1.0900 

0.9995 

0.0078 

0.9933 

0.9844 

0.9692 

0.9460 

0.9130 

0 8688 

0.8125 

4 

1.0000 

1.0000 

0.9999 

0.9997 

0.9960 

0 9976 

0.9947 

0.9898 

0.9815 

0.9688 

6 

0 

0.7351 

0.5314 

0.3''71 

0.2621 

0.1780 

0.1176 

0.0754 

0.0467 

0.0277 

0.0156 

1 

0.9672 

0.8857 

0.7765 

0.6554 

' 0.5339 

0.4202 

0.3191 

0.2333 

■1.1636 

0.1094 

2 

0.9978 

0.9842 

0.9527 

0.901 1 

0.8306 

0.74  4 3 

0.6471 

0.5443 

0.4415 

0.3438 

3 

0.9999 

0.9987 

0.9941 

0.9830 

0.9624 

0.9295 

0.8826 

0.8208 

0.7447 

0.6562 

4 

1.0000 

0.9999 

0.9996 

0.9984 

0.9954 

0.9891 

0.9777 

0.9590 

0.9308 

0 8906 

3 

1.0000 

1.0000 

1.0000 

0.9999 

0.9998 

0.9993 

0.9982 

0.995? 

0.9917 

0.9844 

7 

0 

0.6983 

0 4783 

0.3206 

0.2097 

0.1335 

0 0824 

0.0490 

0.0280 

0.0152 

0.0078 

. 

0.9556 

0.8503 

0.7166 

0.5767 

0.4449 

0.3294 

0.2338 

0.1586 

0.1024 

0.0625 

2 

0.9962 

0.9743 

0.9262 

0.8520 

0.7564 

06471 

0.5323 

0.4199 

0.3164 

0.2266 

3 

0.9998 

0.9973 

0.9879 

0.9667 

0.9294 

0.8740 

0.8002 

0.7102 

0.6083 

0.5000 

4 

1.0000 

0.9998 

0.9988 

0.9953 

0.9871 

0.9712 

0.9444 

0.9037 

0.8471 

0.7734 

5 

1.0000 

1.0C00 

0.9959 

0.9996 

0.9987 

0.9962 

0.9910 

0.9812 

0.9643 

0.9375 

6 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9998 

0.9994 

0.9984 

0.9963 

0.9922 

8 

0 

0.6634 

0.4305 

0.2725 

0.1478 

0.1001 

0.0576 

0.0319 

0 0168 

0.0084 

0.0039 

1 

0.9428 

0.8131 

0 6572 

0 5033 

0.3671 

0.2553 

0.169' 

0.1064 

0.0632 

0.0352 

2 

0.9942 

0.9619 

0.8948 

0.7969 

0.6785 

0.5518 

0.4278 

0.3154 
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0 9996 
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0.8939 
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0.9988 
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6 
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0.9996 
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0.9648 

7 
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1 0000 

1.0000 

1.0000 
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0.9998 

0.9993 

0.9983 

0.9961 

9 

0 
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0.3874 

0.2316 

0.1342 

0.0751 

0.0404 

0.0237 

0.0101 

0.0046 

0.0020 

1 

0.9288 

0.7748 

0.5995 
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0.3003 

0.1960 

0.1211 

0.0705 

0.0385 

0.0195 

2 

0.9916 

0.9470 

0.8591 
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0.6C07 

0.4628 

0.3373 

0.2318 

0.1495 

0.0898 

3 

0.9994 

0.9917 

0.9661 

0.9144 

0.8343 

0.7291 

06089 

0.4826 

0.3614 

0.2539 

4 

1.0000 

0.9991 

0.9944 

0.9804 
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0.9012 

0.8283 

0.7334 

0.6214 

0.5000 

5 

1.0000 

0.99°9 

0.1994 

0.9969 

0.9900 

0.9747 

0.9464 

0.9006 

0.8342 

0.7461 

6 

1.0000 

1.0000 

1.0000 

0.9997 

0.9987 

0.9957 

0.9888 

0.9750 

0.9502 

0.9102 

7 

1.0000 

1 0000 

1.0000 

1.0000 

0.9999 

0.9996 

0.9986 

0.9962 

09909 

0.9805 

8 

1.0000 

J-'OoO 

1.0000 

1,0000 

1.0000 

1.0000 

0.9999 

0.9997 

0.9992 

0.9980 

0.05 

0.10 

0.15 
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0.30 
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0.40 
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0.50 
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0.3487 

0.1969 

0.1074 

0.0563 
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0.0135 

C.0060 

0.0025 

0.0010 

0.9139 

0.7361 

0.5443 

0 3758 

0.2440 

0.1493 

0.0860 

0.0464 

0.0232 

0.0107 

0.9885 

0.9298 

0.8202 

0.6773 

0.5256 

0.3828 

0.2616 

01673 

0 0996 

0.0547 

0.9990 

0.9872 

0.9500 

08291 

0.7759 

0 6496 

0.5138 

0.3823 

0.2660 

0.1719 

0.9999 

0.9984 

0.9901 

0 9672 

09219 

0 8497 

0.7515 

06331 

0.5044 

0.3770 

1.0000 

0.9999 

0.9986 

0.9936 

0.9803 

0.9527 

° 9051 

08338 

0.7384 

0.6230 
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1.0000 

0 9999 

0.9991 

0 9965 

0.9894 

0.9740 

0.9452 
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0.8281 

1.0000 

1.0000 

1.0000 

0.9999 

0.9996 

0.9984 

0.9952 

0 9877 

0.9726 

0 9453 

1 0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9995 

0.9983 

0.9955 

0.9893 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.5999 

0.9997 

0.9990 

05638 

0.3138 

0.1673 

00859 

0.0422 

00198 

0.0088 

0 0036 

0 0014 

0.0005 

0.8981 

0 6974 

0.4922 

0.4221 

0.1971 

0 1130 

0.0606 

0.0302 

0 0139 

0 0059 

0.9848 

0.9104 

0.7788 

C.6174 

0.4552 

0.3127 

0.2001 

0.1189 

0.0652 

0 0327 

0.9984 

0.9815 

0.9306 

0.8389 

0.7133 

05696 

0.4256 

0.2963 

0.1911 

0.1133 

0.999° 

0.9972 

0.9841 

0.9496 

0.88S4 

0.7897 

0.6683 

0.5328 

0.3971 

0.2744 

1.0000 

0.9997 

0.9973 

0.9883 

0.9657 

0.9218 

0.8513 

0.7535 

0.6331 

0.5000 

1.0000 

'..0000 

0.°997 

0.9980 

0.9974 

0.9784 

0.949° 

0.90C6 

0.8262 
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1.0000 

0.9998 

0.9988 

0.9957 

0.9878 

0 9707 

0.9390 

0.8867 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9794 

0.9980 

0.9941 

0.9852 

0.9673 
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1.0000 

1.0000 

1.0000 

1.0000 
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0.9998 

0.9993 

0.9978 

0.9941 
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1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 
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0.9998 

0.9995 

0.5404 

0->824 

0.1422 

00687 

0.0317 
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0.0057 

0.0022 

0.0005 

0.0002 
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0.6590 

0.4435 

0.2749 

0.1584 
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0.0424 

0.0196 
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0.0032 

0.9804 
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0.7358 
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0.3902 

0.2528 

0.1513 

0 0834 

0.0421 

0 0193 

0.9978 

0.9744 

0.9078 

0.7946 
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04925 

0.3467 

0.2253 

0.1345 

0 0730 

0.9998 

0.9957 

0.9761 

0.9274 
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0.7237 

0.5833 

0.4382 

0.3044 

0.1938 

1,0000 

0.9995 

O.C954 

0.9806 

0.9456 
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0.7873 
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0.5269 

0.3872 

1.0000 

0.9999 

0.9993 

0 9961 

0.98S7 

0.9614 

0.9154 

0.8418 

0.7393 

06123 

1.0000 

1.0000 

0 9999 

0.9994 

0.9972 

0.9905 

0.9745 

0.9427 

0 8883 

0.8062 
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1.0000 

0.9999 

0.9996 

0.9983 
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1.0000 

1.0000 

1.0000 

0.9993 

0i992 

0.9972 
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0.9999 

0.9997 

0 9989 
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1.0000 
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0.9999 
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0.0097 
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0.0013 
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0.0001 
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0.3983 
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0 0049 
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0 8661 
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05017 
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0.9969 
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00979 
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0.9997 

0.9935 

0.9653 
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0.7940 

06543 

0.1005 
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0.1334 

1.0000 

0.9991 

0,9925 

0.9700 

0.9198 

0.8346 

0.7159 

0.5744 

0.4268 

0.2905 

1.0000 

0.9999 

0,9987 

0.9930 

0.9757 

0.9376 

0.3705 

0.7712 

0 6437 

0.5000 

1.0000 

1.0000 

0.9993 

0.9988 

0.9944 

0.9818 

0.9533 

0.9023 
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0.7095 

1.0000 
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0.9990 

0.9960 

0.9874 
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1.0000 

0.9999 

0.9993 

0.9975 
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0.9539 
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0.9987 

0.9959 

0.9888 
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0.9999 
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0.9983 
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0 4877 

0.2288 
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6 

1.0000 

0.9998 

0.9978 
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0.6925 
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7 

1.0000 

1.0000 

0.9997 

0.9976 

0.9897 

0.9685 
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0.8499 

0.7414 
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1 0000 

1.0000 

1.0000 

0.9996 

0.9978 

0.9917 

0.9757 

0.9417 
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9 
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1 0000 

1.0000 

1.0000 

0.9097 

0.9983 

0.9940 

0.9825 
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1.0000 

1.0000 
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0.9989 
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0.9961 
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1.0000 
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1.0000 

1.0000 

1.0000 

1.0000 
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0,9997 
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1.0000 
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0.9999 

! 13 

0 
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0 829^ 
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0.0353 
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0 0052 
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0.0271 
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3 

0.9945 

0.9444 
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0.4613 

0.2969 
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0.0905 

0.0424 
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4 

0.9994 

0.5873 

0.9383 

0.8358 

0.6865 
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0.3519 

0.2173 

0.1204 

0.0592 

5 

0.9999 

0.9978 
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' 0.4032 
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0.1509 

c 
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0.9434 
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0.9958 
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8 
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0.9999 

0.9992 

0.9958 
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0.9578 
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0.6964 

1 

9 
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0.9999 

0.9992 

0.9963 

0.9876 

0.9662 
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10 
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1.0000 

1.0006 

0.9999 

0.9993 
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0.9907 

0.974  b 

0.9408 

11 
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1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9995 

0.9981 
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12 
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0.9989 
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13 
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1.0000 
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0.9999 

0.9995 

14 
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t.odoo 

1 
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0.0033 
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i 

l 
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0.5147 

0.2839 
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0.0033 

0.0010 

C.0003 

j 

2 

0.9571 

0.7892 

0.5614 

0.3518 

0.1971 
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0.0451 
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0.0066 

0.0021 

3 

0 9930 

0.9316 

0.7y>9 

0.5981 

0.4050 

0,2459 

0.1339 
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0 0281 

0.0106 

i 

4 

0.9991 

0.9830 

0.9209 

0.7982 

0.6302 

0.4499 

0.7892 

0.1666 

0.0853 
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i 

0.9999 

0.9967 

0.9765 

0.9183 

08103 

0 6598 

0 4900 

0.3288 

0.1976 

0.1051 

6 

1.0000 

0.9995 

0.9944 

0.9733 
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C.8247 

0 6881 

0.5272 

0 3660 
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7 

1.0000 

0.9999 

0.9989 

0.9930 

0.9729 

0,7256 

0.8406 

0.7161 
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0.4018 
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8 

1 0000 
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0.9998 
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1 

9 

1 0000 
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0.9514 
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0.9894 
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13 
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1.0000 
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0.9999 

0.9994 
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14 
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1.0000 
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1.0000 
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1 .0090 
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13 
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1.0000 
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1 0000 
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0 
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0.1668 
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0 oono 
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1 
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0.48’  8 

0.2525 

0.1182 
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0,0021 

0 0006 
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| 

2 
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0.7b  <8 
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0.3096 

0.1637 
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00123 
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0 0012 

> 

3 
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0.7536 

0.5489 

0.353C 

0.2019 
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0.0464 

0.0184 

0 0063 

1 

4 
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0.1260 
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Table  II  (Continued) 


X 

0.05 

0.10 

0.15 

0.20 

p 

0.25 

0.30 

0.35 

040 

045 

0.50 

5 

0.9999 

0.9953 

0.9681 

08943 

0.7653 

0.5968 

0.4197 

0.2639 

0.1471 

0.0717 

6 

1.0000 

0.9992 

0.9917 

0.9623 

0.8929 

0.7752 

0 6138 

0.4478 

0.2902 

0.1662 

7 

1.0000 

0.9999 

0.9983 

0.9891 

0.9598 

0.8954 

0.7872 

0.6405 

0.4743 

0.3145 

8 

1.0000 

1.00OO 

0.9997 

0.9974 

0.9876 

0.5S97 

0.9006 

0.8011 
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0.5000 

9 

1.0000 

1.0000 

1.0000 

0.9995 

0.9965 

0.9873 

0.9617 

0.9081 

08166 

0 685S 

IP 

1 0000 

1.0000 

1.0000 

0.9999 

0.9994 

09968 

0.9880 

0.9652 

0.9174 

0.8338 

11 

1.0060 

1.0000 

1.0000 

1 0000 

0.9999 

0.9995 

0.9970 

0.9894 

0.9699 

0.9233 

12 

1.0000 

1.0000 

1.0000 

1.0000 

1.0003 

0.9999 

0.9994 

0.9975 

0.95)4 

0.9755 

13 

1.0000 

1.0000 

1.0000 

' .0000 

1.0000 

1.0000 

0.9999 

0.9995 

0.9981 

0.9936 

14 

1.0000 

1.0000 

, 1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9997 

0.9988 

13 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

l.OOCO 

1,0000 

0.9999 

16 

1.0000 

1.0000 

1.0000 

1.0U00 

1.0000 

1.0000 

1 0000 

1.0000 

1.0000 

1.0000 

0 

P.3972 

0.151 1 

0.0536 

0.0180 

0.0056 

0.0016 

0.0004 

O.OCO. 

0.0000 

0.0000 

i 

0.7735 

0.45  C 3 

0.2241 

0 0991 

0 0395 

0.0142 

0.0046 

0.0013 

0.0003 

0.0001 

2 

0.9419 

0.7378 

0.4797 

0.2713 

0.1353 

0.060C 

0.0236 

0.0032 

0 002.* 

0.0007 

3 

0.9891 

0.9018 

0.7202 

0*010 

0.3057 

0.1646 

0.0783 

0.0228 

0.0120 

0.0038 

4 

0.9935 

0.9718 

0.8794 

0.7164 

0.5187 

0.3327 

0.1886 

0.0942 

0.0411 

0.0154 

3 

0.9998 

0.9936 

0.9581 

0.8671 

0.2175 

0.5344 

0.3553 

0.2088 

0.1077 

0.0481 

6 

1.0000 

0.9988 

0.9882 

0.9487 

0.8610 

0.7217 

0.5491 

0.3743 

0.2258 

0.1189 

7 

1.0000 

0.9998 

0.9973 

0.9837 

0.9431 

0.8593 

0.7283 

0.5634 

0.3915 

0.2403 

8 

1.0000 

1.0000 

0.9995 

0.9957 

0.9807 

C.9404 

0.8609 

0.7368 

0.S778 

0.4073 

9 

1.0000 

1.0000 

0.9999 

0.9991 

0.9946 

0.9790 

P 9403 

0.86S3 

0.7473 

0 5927 

to 

1.0000 

1.0000 

1.0000 

0.9998 

0.9988 

0.9939 

0.9788 

0.9424 

0.8720 

0.7597 

ll 

1.0000 

1.0000 

1.0000 

1.0000 

0.9928 

0 9986 

0.9938 

0.9797 

0.9463 

0.  oil 

12 

1.0000 

1.C'00 

1.0000 

1.0000 

1.0000 

0.9997 

0.998b 

0.5)42 

0.9817 

0.9519 

13 

1.COOO 

1,0000 

1.0000 

1.0000 

1 0000 

1.0000 

0.9997 

0.9987 

0.9951 

0.9846 

14 

1.0000 

1.0000 

1.0000 

1.0060 

1.0000 

1.0000 

1.0000 

0.9998 

0.9990 

0.9962 

15 

1.0000 

1"<100 

1.0000 

l/'OOO 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9993 

16 

1.0000 

1.0000 

1.0000 

|O0nn 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0 

0.3774 

0.1351 

0.0456 

0.0144 

0 0042 

0.0011 

0.0003 

0 0001 

0 0000 

06)100 

1 

0.7547 

0.4203 

0.1985 

0.0829 

0 0310 

0.0104 

0.0)3’ 

0.0008 

0.0002 

0.0000 

2 

0.9335 

0.7053 

0.4413 

0.2369 

0.1113 

0.0402 

0.0  C 

0 0055 

0.00)6 

0.0004 

3 

0.9868 

0.8850 

0.6S41 

0.4551 

0.2630 

0.1332 

0 6591 

0 0230 

06,177 

0.0022 

4 

0.9980 

0.9648 

0.8556 

0 6733 

0.4654 

0.2822 

0.1500 

0 0696 

rt  0280 

0.0096 

5 

0 9998 

0.95!  4 

0.9463 

08  369 

0.6678 

0473' 

^'>'>68 

0 1629 

0.077/ 

0.0318 

6 

1.0000 

0.9983 

0.9837 

0.9324 

08251 

0.6f  j5 

0.4812 

0.3081 

0.1727 

0.083., 

7 

1.0000 

0.9997 

0.9959 

3.9767 

0.9225 

0.5180 

0 6656 

0.4878 

0.3)69 

0 *,  m 

8 

1 9000 

1.0000 

0.9992 

0 9933 

0.9713 

0.9161 

0 8145 

0 6675 

0 4940 

0.3238 

9 

1.0000 

1.0000 

0.9999 

0.2984 

0.9911 

0.9674 

0.9125 

0.8139 

05710 

0.5000 

10 

1.0000 

l.OOCff 

1,0000 

0.9997 

0.9977 

0.9895 

0.9653 

0.9115 

0.8159 

0 6762 

11 

1.0000 

1.0000 

1.0000 

12)000 

0.999.1 

0 9972 

0.9886 

0.9648 

0.9129 

0.8204 

12 

1.0000 

1.0000 

l.oooo 

1 OOOO 

0.9999 

0.9994 

0.9969 

0.9884 

0.9658 

0.9165 

13 

l ocoo 

1. 0000 

1.0600 

,.C00 

1.0000 

C.99/9 

0.99°'; 

0.9966 

0 9391 

0 9682 

14 

1.9000 

1 0000 

1.0000 

1.0100 

1.0000 

1.0000 

0 6999 

0.9994 

0.9972 

0.9904 

15 

1.0000 

1.0C0C 

l.oooo 

1.0000 

1.0000 

1.0000 

1.0000 

0.9999 

0.9995 

0.9978 

16 

!~vu<> 

1.000C 

\000O 

! OSO 

1.0000 

1. 0,i00 

1.0060 

1.0000 

0.9999 

0.9996 

17 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0 

0.3585 

0.1216 

0.0388 

) 0115 

0 1032 

0 0003 

0.0002 

o oooo 

0.0000 

0 0000 

1 

0.7358 

0.3917 

0.1756 

0.0692 

0.0213 

0 0076 

On,)2l 

00005 

0.000 1 

0.0000 

2 

0.9245 

0.6769 

0.4 '19 

0.206' 

0°13 

0.0355 

00121 

0 0036 

0..'00° 

03)002 

3 

0.9841 

0.8670 

0 6477 

0.4114 

2252 

0.1071 

00444 

0.0160 

0.0049 

70013 

4 

0.9974 

0.9568 

0 8298 

0 6296 

0.4148 

0.2375 

0.1182 

0,0510 

OOlSy 

0 0059 

5 

0.9997 

0 7887 

0.9327 

08042 

0 6172 

0.4164 

0.29*4 

0.1256 

0.0553 

0.020/ 

6 

1 .0000 

0 9976 

0.9781 

0.9123 

0.7858 

0.6080 

0.4 16« 

0 2500 

0.1299 

0 0577 

7 

'1.0000 

199% 

0.994, 

0.9679 

0.8982 

0.7723 

P.6010 

0.4159 

0.2520 

6.1316 

8 

' 1.0000 

'-.99°9 

0.9987 

0.9900 

0.9591 

0 8867 

0.7624 

0.5956 

0.4143 

0 2537 

9 

1.0000 

1.0  7w0 

0.9998 

0.9974 

0.986! 

0.9520 

0 8782 

0.7553 

0.5914 

0.4119 

10 

1.0000 

1.0000 

1 0090 

0.9994 

0.9961 

0.9829 

0.946' 

0 8725 

0.7507 

0.5881 

11 

1 0000 

1 0000 

1.0000 

0.9999 

0.9991 

0.9949 

0.9804 

0.9475 

0.8692 

0,7483 

12 

1.0000 

1.0000 

1.0000 

1.0000 

0 9998 

0.9987 

0.5710 

0.9790 

0 5420 

0.8684 

13 

1.0000 

1.0000 

121000 

l.oooo 

1.0000 

0.9997 

0,9985 

0.0935 

0.9  ’86 

0 9423 

)4 

1.0000 

12)000 

1.0000 

1,0000 

1.0000 

1.0000 

0.9967 

0.9984 

0.9936 

0.9793 

15 

1.0000 

1 0000 

1.0000 

1.0060 

'.0000 

1.0000 

1.00-60 

0,9997 

0.9985 

0.9941 

(6 

1.COOO 

1.0000 

1.0000 

1.0000 

1.0000 

1.0,100 

>.0000 

1.0000 

0.9997 

0.9987 

17 

1.0000 

1.0000 

1.0000 

l.oooo 

1.0000 

1.0000 

l.oooo 

1.0000 

1.0000 

0,9998 

IS 

1.0000 

i.oono 

1.0000 

l.oooo 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 
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GUIDESHEET  #12.  Kendall's  Tan  Test  for  Trend. 

Assumptions : 

©•* — This  method  is  used  when  the  assumptions  required  for 
Control-Chart  significance  testing  are  not  valid 
assumptions.  Also,  this  method  allows  quicker  signi- 
ficance-testing in  most  cases,  though  it  is  not  as 
powerful  a test. 

Steps : 

© Arrange  the  data  in  vertical  columns  (as  in  example) 

with  column  headings  as  follows.  Columns  4,  5,  and 
6 will  be  filled  in  later. 


Period 

Period 

Measurement 

Rank 

Ranks 

Ranks 

Name 

Number 

of 

Greater 

Less 

xi 

*i 

?i 

G 

L 

■-Count  the  total  number  of  observations  or  measure- 
ments that  are  to  be  tested  for  trend.  Call  this 
number  "n". 

•-Rank  the  measurements  by  size.  That  is,  the  smallest 
number  in  column  3 will  receive  the  rank  of  "1"  and 
the  largest  the  rank  of  "n"  (if  no  ties) , whatever 
"n"  was  in  Step  2.  Record  each  rank  in  column  4, 

Rank  of  y^.  If  there  are  tied  measurements,  both 
(all)  of  them  should  be  given  tht  average  of  the  two 
(all)  successive  ranks  they  would  have  received  if 
one  were  bigger  than  the  other. 

•-Consider  the  top  entry  in  column  4,  the  rank  of  y^. 
Count  the  number  of  ranks  which  exceed  it.  Record 
this  number  at  the  top  of  the  "G"  column,  column  5. 

--Consider  the  next  entry  in  column  4,  the  rank  of  y.7. 
Count  the  number  of  remaining  ranks  in  column  4 
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which  exceed  it.  That  is,  the  rank  of  y^  is  not  to 
be  considered  again.  Record  this  number  in  the 
appropriate  place  in  column  5. 


-Continue  counting  exceeding  ranks  for  remaining 
measurements,  filling  column  5. 


-Compute  the  sum  of  the  values  in  column 
record  this  sum  below  column  5. 

-Again,  consider  the  top  entry  in  column 
of  y-^.  Count  the  number  of  ranks  which 
than  it.  Record  this  number  at  the  top 
column,  column  6. 


5 (EG)  and 

4 , the  rank 
are  less 
of  the  "L" 


-Consider  the  next  entry  in  column  4 (rank  of  • 
Count  the  remaining  ranks  which  are  smaller  and 
record  the  number  in  column  6. 


-Continue  for  the  remaining  entries  of  column  4, 
filling  column  6. 


11)  — Compute  the  Siam  of  the  values  in  column  6 (EL)  and 
record  this  sum  below  column  6. 


--Compute  the  value  of  the  test- statistic,  T (Computed). 


©- 


T (Computed)  = £g  - £L 

-Choose  a significance  level  a.  A significance  level 
of  a=.05  or  a=.10  is  usually  appropriate. 

-Form  the  null  hypothesis  RQ  and  alternative  hypoth- 
esis H-^.  These  may  be  picked  from  the  three  choices 
below. 


(A) 

H : 
o 

There  is 

no  downward  trend. 

Hr 

There  is 

a downward  trend  in  the  data. 

(B) 

H : 

0 

There  is 

no  upward  trend. 

V 

There  is 

an  upward  trend  in  the  data. 

(G) 

H : 
o 

No  trend 

exists . 

Hr 

There  is 
downward 

either  an  upward  trend  of  a 
trend  in  the  data. 
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If  (A)  is  chosen,  continue  with  Step  15.  If  (B)  is 
chosen,  go  to  Step  17.  If  (C)  is  chosen,  go  to 
Step  19 . 

Enter  Table  III  with  p=l-a  (Step  13)  and  the  value 
of  n (Step  2)  to  find  the  numerical  value  of  the 
test-statistic.  When  testing  for  a downward  trend, 

T (Table)  equals  the  negative  of  the  value  found  in 
the  p=l-a  column  across  from  the  proper  value  of  n. 

If  T (Computed)  from  Step  12  is  greater  than  or  equal 
to  T(Table),  T (Computed) £T (Table) , accept  the  null 
hypothesis  HQ  of  no  downward  trend.  If  T (Computed) 
<T (Table),  reject  the  null  hypothesis  in  favor  of 
the  alternative  hypothesis  H^:  There  is  a downward 
trend  in  the  data.  This  step  completes  the  test  for 
a downward  trend. 


Enter  Table  III  with  p=l-a  (Step  13)  and  the  value 
of  n (Step  2)  to  find  the  table  value  of  the  test- 
statistic  T(Table).  That  is,  T(Table)  is  found  in 
the  p=l-a  column  across  from  the  proper  value  of  n. 

If  T (Computed)  from  Step  12  is  less  than  or  equal  to 
T(Table),  T(Computed) 5T (Table) , accept  the  null 
hypothesis  HQ  of  no  upward  trend.  If  T (Computed) 

>T (Table),  reject  the  null  hypothesis  in  favor  of 
the  alternative  hypothesis  H^:  There  is  an  upward 
trend  in  the  data.  This  step  completes  the  test  for 
an  upward  trend. 


-Enter  Table  III  with  p=l-a/2  (a  from  Step  13)  and  the 
value  of  n (Step  2)  to  find  the  table  value  of  the 
test-statistic  T(Table).  That  is,  T(Table)  is  found 
in  the  p=l-a/2  column  across  from  the  proper  value 
of  n. 
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(20)  — If  T(Computed)  from  Step  12  is  less  than  the  nega- 

tive of  T(Table)  or  greater  than  T (Table),  reject 
the  null  hypothesis  Ho  in  favor  of  the  alternative 
hypothesis  H^:  There  is  either  a downward  trend  or 
an  upward  trend  in  the  data.  That  is,  if  T (Computed) 
<-T(Table)  or  T (Computed) >T (Tab le) , reject  HQ  in 
favor  of  H^. 

(21)  — If  T(Computed)  from  Step  12  is  greater  than  or  equal 

to  the  negative  of  T (Table)  and  less  than  or  equal  to 
T(Table),  accept  the  null  hypothesis  Hq  of  no  trend. 
That  is,  if  -T (Table) -T (Computed) -T (Table) , accept  Hq. 
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EXAMPLE  for  Kendall’s  Tau  Test  for  Trend  (Guidesheet  #12) 


U 


r • 


Steps : 

@ — The  data  is  arranged  as  follows: 


j 

{ 

\ 

1 

Period 

Name 

Period 

Number 

xi 

Measurement 

?i 

— 

Rank 
of  y± 

Ranks 

Greater 

G 

Ranks 

Less 

L 

;!  , 
i 

Jan 

1 

89.9 

2 

12 

0 

Feb 

2 

90.5 

5.5 

9 

3 

Mar 

3 

92.2 

15 

0 

12 

I* 

J' 

Apr 

4 

91.2 

11 

3 

8 

1 

May 

5 

90.7 

7.5 

5 

A 

! 

Jun 

6 

91.3 

12 

2 

7 

i 

Jul 

7 

90.7 

7.5 

4 

4 

j 

Sep 

8 

91.7 

14 

0 

7 

f 

Oct 

9 

89.9 

O 

4. 

5 

0 

ji 

Nov 

10 

90.1 

4 

4 

1 

Ij 

Dec 

11 

90.8 

9 

2 

2 

1 

Jan 

12 

90.5 

5.5 

2 

1 

ij 

Feb 

13 

91.4 

13 

0 

2 

1 

Mar 

14 

89.9 

2 

1 

0 

Apr 

15 

91.1 

10 

0 

0 

©- 

©■ 


©- 


©- 


EG=49  EL-51 

-Use  all  the  data,  giving  us  n=l5. 

-The  value  89.9  is  the  smallest  but  this  value  appears 

3 times  in  data.  If  these  were  not  tied,  but  just 

barely  unequal,  their  ranks  would  be  1,  2,  3.  The 

average  of  these  is  1+2+3-2.  The  rank  of  "2” 

3 

is  assigned  to  all  three.  The  value  90.1  is  ranked 
"4",  both  90.5' s ranked  5.5. 

Consider  the  rank  of  y^,  the  number  "2".  The  number 
2 is  exceeded  by  a total  of  12  numbers  underneath  it 
in  the  column.  Record  ”12”  in  column  #5. 

Consider  the  next  rank,  the  rank  of  y£  which  is  "5.5". 
Nine  (9)  numbers  underneath  "5.5"  exceed  the  number 
"5.5".  Record  "9".  Next,  no  ranks  exceed  15,  record 
"0".  Next,  3 ranks  eyceed  8 (remember  only  ranks 
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underneath  are  considered)  so  record  "3n. 

•-Continue  as  in  Step  5,  filling  column  #5  as  shown. 

•-If  we  add  up  column  #5  we  get  EG=49.  Record  this  in 
the  space  designated  above. 

-Consider  again  the  rank  of  y^,  the  number  "2".  There 
are  zero  ranks  underneath  the  number  2 which  are  less. 
Record  "O'1  in  column  #6. 

Consider  the  rank  of  y£,  the  number  5.5.  There  are 

3 smaller  ranks  underneath  the  value  5.5  in  column  #4. 
Record  "3"  in  column  #6. 

10) --Consider  the  remaining  ranks  of  column  #4.  For  rank 
"12"  we  get  "12",  for  rank  "8"  we  get  "8",  for  rank 
"5.5"  we  get  "4"  and  so  on,  filling  column  #6  as 
shown . 

•-Add  up  the  numbers  in  column  #6.  We  get  EI.=51  and 
record  below  column  #6. 

—T (Computed)  = £G  - £L  = 49-51  = -2 

T (Computed)  - -2 


B 

12) 


© 

© 


— We  choose  a=.05 

— We  are  mostly  interested  in  the 
presence  of  a downward  trend,  so 
we  choose  (A) . 


15  - 


-Go  to  Table  III,  page  134. 
p=l-a=l-.05=.950  and  n=15. 

Go  down  the  p=. 950  column  and 
across  from  n=15  we  find  "33". 
The  negative  of  this  is  our 
Table  value,  so 

T (Table)  = -33 


16J --The  value  -2  is  greater  than  -33  (less  negative  means 
greater)  so  we  accept  Hq:  No  significant  downward 


n 

p = .900 

.950 

4 

4 

4 

5 

6 

6 

6 

7 

9 

7 

9 

11 

8 

10 

14 

9 

12 

16 

10 

15 

19 

11 

17 

21 

12 

18 

■ 24 

13 

22 

26 

14 

23 

31 

15 

27 

33. 

4 / 

1C 
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trend  at  the  .05  significance  level.  That  is 
-2>-33,  accept  Ho« 


The  choice  (B)  for  test  of  upward  trend  is  much  the  same. 
Let's  say  we  chose  (C) . 

Go  to  Table  III,  page  134.  p=l-  /2=1- .05/2=1-. 025 
=.975.  Look  under  the  p=.975  column  and  across 
from  n=15  and  find  "39".  The  value. 

(20)  — T (Computed) =- 2 is  neither  less  than  the  negative  of 

T(Table)=-39  nor  greater  than  T(Table)=39,  so  we 
cannot  reject  HQ.  Go  to  Step  21. 

(21)  — We  find  -39-~2^39.  That  is,  -T (Table) ^T (Computed) 

^T (Table)  so  we  accept  the  null  hypothesis  Hq:  No 
significant  trent  exists  at  the  .05  level  of 
significance. 
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Table  III 

Quantiles  of  the  Kendall  Test  Statistic 

(Ref  1:391,392) 


n 

p = .9(50 

.950 

.9?.i 

.990 

.995 

4 

4 

4 

6 

6 

6 

5 

6 

6 

8 

8 

10 

6 

7 

9 

11 

11 

13 

7 

9 

11 

13 

15 

17 

8 

10 

14 

16 

18 

20 

9 

12 

16 

18 

22 

24 

10 

15 

19 

21 

25 

27 

11 

17 

21 

—> 

29 

31 

12 

18 

■ 24 

28 

34 

36 

13 

22 

26 

32 

38 

42 

14 

23 

31 

35 

41 

45 

15 

27 

33. 

39 

47 

51 

16 

28 

36 

44 

50 

56 

17 

32 

40 

48 

56 

62 

18 

35 

43 

51 

61 

67 

19 

37 

47 

55 

65 

73 

20 

40 

50 

60 

70 

78 

21 

42 

54 

64 

76 

84 

22 

45 

59 

69 

81 

89 

23 

49 

63 

73 

87 

97 

24 

52 

66 

78 

92 

102 

25 

56 

70 

84 

98 

108 

26 

59 

75 

89 

105 

115 

27 

61 

79 

93 

111 

123 

28 

66 

84 

98 

116 

128 

29 

68 

88 

104 

124 

136 

30 

73 

93 

109 

129 

143 

31 

75 

97 

115 

135 

149 

32 

80 

102 

120 

142 

158 

33 

84 

106 

126 

150 

164 

34 

87 

111 

131 

155 

173 

35 

91 

115 

137 

163 

179 

36 

94 

120 

144 

170 

188 

37 

98 

r26 

150 

176 

196 

38 

103 

131 

155 

183 

203 

39 

107 

137 

161 

191 

211 

40 

110 

142 

168 

198 

220 

134 


GUXDESHEET  #12.  Spearman's  Test  for  Trend 


Assumptions : 

@ — This  method  is  used  when  the  assumptions  required  for 
Control-Chart  significance-testing  a not  valid 
assumptions.  Also,  this  method  allows  quicker 
significance-testing  in  most  cases,  though  it  is  not 
as  powerful  a test. 


Steps : 


(T) — Arrange  the  data  in  vertical  columns  (as  in  example) 
with  column  headings  as  follows.  Columns  4 through 


7 will  be  filled  in  later. 


Period 

Name 


Meas- 

ure- 

ment 


Difference 

Squared 

(R(xi)-R(yi)) 


Count  the  total  number  of  observations  or  measure- 
ments that  are  to  be  tested  for  trend.  Call  this 
number  "n". 

This  method  uses  the  ranks  of  both  the  time  periods 
and  the  measurements..  Since  the  time  periods  are 
recorded  in  their  natural  order,  the  period  numbers 
are  naturally  ranked  in  size  from  smallest  to 
largest.  Therefore,  for  column  4,  Ranks  of  x^,  just 
duplicate  the  entries  of  column  2. 

Rank  the  measurements  by  size.  That  is,  the  smallest 
number  in  column  3 will  receive  the  rank  of  "1"  and 
the  largest  the  rank  of  "n",  whatever  "n"  was  in 
Step  2.  Record  each  rank  in  column  5,  Rank  of  y^. 

If  there  are  tied  measurements,  both  (all)  of  them 
should  be  given  the  average  of  the  two  (all) 


successive  ranks  they  would  have  received  if  one 
were  bigger  than  the  other. 
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-For  each  measurement,  subtract  the  ranks  and  enter 
this  difference  (whether  positive  or  negative)  in 
column  6,  R(xi)-R(y^) . 

-Square  each  entry  in  column  6 and  erter  each  number 
in  column  7,  (R(x^)-R(y^)) 2 . 


-Add  up  the  values  in  column  7 and  record  this  sum, 

£ (R(x^)-R(y^))  ° , below  column  7 This  is  the  com- 
puted value  of  the  test-statistic  T (Computed) . 


-Choose  a significance  level  a.  A significance  level 
of  a=.05  or  e-.lO  is  usually  appropriate. 


-Form  the  null  hypothesis  H and  alternative  hypothesis 
H-^.  These  may  be  picked  from  the  three  choices  below. 


(A)  Ho: 
Hi: 

(B)  Hq: 

Hr 

<C)  H0: 


There  is  no  downward  trend. 

There  is  a significant  downward  trend. 


There  is  no  upward  trend. 

There  is  a significant  upward  trend 
in  the  data. 

There  is  no  significant  trend. 

There  is  either  an  upward  trend  or  a 
downward  trend  in  the  data. 


If  (A)  is  chosen,  continue  with  Step  10.  If  (B)  is 
chosen,  go  to  Step  13.  If  (C)  is  chosen,  go  to  Step 
16. 

(lO) — Enter  Table  IV,  page  142,  with  p=a,  chosen  in  Step  8, 
and  the  "n"  from  Step  2 to  find  the  table  value  of 
the  test-statistic.  That  is,  T (Table)  is  found  in 
the  p column  across  from  the  proper  value  of  n.  Also, 
the  value  1/3  n(n2-l)  will  be  needed  for  the  test. 

It  may  be  computed,  but  is  provided  in  the  far-right 
column  across  from  the  proper  value  of  n. 
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--If  T(Computed)  is  less  than  or  equal  to  l/3n(n2-l) 
minus  T (Table) , accept  the  null  hypothesis  H^:  There 
is  no  downward  trend.  That  is,  if  the  following  is 
true,  accept  Ho  at  the  a-level  of  significance. 

T (Computed)  5^-  n(n2-l)  - T (Table) 

--If  T (Computed)  is  greater  than  l/3n(n2-l)  minus 
T(Table),  reject  the  null  hypothesis  Hq  in  favor  of 
the  alternative  hypothesis  H^:  There  is  a significant 
downward  trend.  That  is , if  the  following  is  true, 
accept  H-^. 


T (Computed) n(n2-l)  - T (Table) 

This  completes  the  test  for  a downward  trend. 


•-Enter  Table  IV,  page  142,  with  p=a,  chosen  in  Step  8, 
and  the  "n"  from  Step  2 to  find  the  table  value  of 
the  test-statistic.  That  is,  T(Table)  is  found  in 
the  p=a  column  across  from  the  proper  value  of  n. 

•-If  T(Computed)  from  Step  7 is  greater  than  or  equal 
to  T(Table),  accept  the  null  hypothesis  Ho:  There 
is  no  upward  trend.  That  is,  if  T(Computed) >T (Table) , 
accept  Hq. 

•-If  T(Computed)  is  less  than  T(Table),  reject  the  null 
hypothes Is  HQ  in  favor  of  the  alternative  hypothesis 
H^:  There  is  an  upward  trend.  That  is,  if  T (Computed) 
<T(Table),  accept  H^.  This  completes  the  test  for 
an  upward  trend. 


Enter  Table  IV,  page  142,  with  p=a/2,  chosen  in 
Step  8,  and  the  n from  Step  2 to  find  the  table 
value  of  the  test-statistic  T(Table).  That  is, 
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T (Table)  is  found  in  the  p=q/2  column  across  from  the 
proper  value  of  n. 

(l7) — If  T(Computed)  from  Step  7 is  greater  than  or  equal 
to  T (Table)  and  less  than  or  equal  to  l/3n(n2-l) 
minus  T (Table) , accept  the  null  hypothesis  H&:  There 
is  no  significant  trend.  That  is,  if  the  following 
is  true,  accept  the  null  hypothesis  Hq. 

T (Tab le^T (Computed)  n(n2-l)  - T (Table) 

If  T(Computed)  is  less  than  T(Table)  or  greater  than 
l/3n(n2-l)-T(Table) , reject  the  null  hypothesis  II 
in  favor  of  the  alternative  hypothesis  There  is 
either  an  upward  or  downward  trend.  That  is,  if 
either  one  of  the  following  relationships  is  true, 
accept  H^. 


T (Computed) <T (Table) 
or  T (Computed) >j  n(n2-l)  - T (Table) 

This  completes  the  Spearman  Test  for  trend. 
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EXAMPLE  for  Spearmaii  Test  for  Trend  (Guidesheet  v'13) 


Steps : 

@ The  data  is  arranged  as  follows : 


Period  Period 
Name  Number 


2.5 

10 

1 


Difference  Difference  I 
^Squared  j 

R(xi)-R(yi)  (R<.x1)-R(yi)) 2 1 


64 

36 

16 

4 

0 

4 

20.25 

30.25 
1 

81 


nR(xi)-R(yi>) 

= 256.5 

We  will  use  all  data  for  the  test,  so  n=10 

Rewrite  the  numbers  in  column  #2  in  column  #4  a r, 
shown  above. 

The  smallest  val _e  of  y.  is  94.2.  It  receives  the 
rank  of  "1".  The  next  largest  value  yg=94.8  is  tied 
with  y 7=94. 8.  If  one  were  bigger  than  the  other, 
they  would  have  received  ranks  of  2 and  3.  We  will 
give  each  the  average  rank  of  2.5.  Rank  the  remain- 
der as  is  shown  in  the  table,  column  #5. 

Subtract  the  rank  9 from  the  rank  1.  Record  the 
difference  (-8)  in  column  #6  as  shown.  Do  this  for 
each  pair  of  measurements. 

The  square  of  -8  is  +64.  This  is  recorded  in  column 
#7.  The  square  of  -6  is  +36,  and  so  on  as  shown. 


(T) — Adding  the  numbers  in  column  #7  gives 


I 

-dH 


a 

«-  jfl 
oaT 


i 


> 1 1 

ssr  i 


M 


jfei  I 

I ^ ! 

m ' « i 


iw  y 

kU  « 


Lis 
1 .V 


®- 

©■■ 


JfcCxp-RCyp)1  - 256.5 

as  recorded  below  column  #7.  T (computed) =256. 5 

--We  will  use  the  a=.05  level  of  significance. 

--We  are  interested  primarily  in  the  presence  of  a 
downward  trend,  so  (A)  is  chosen. 

--Go  to  Table  IV,  page  142.  We  look  in  the  p=.050 
column  and  across  from  n=10  and  find  T(Table)=74. 
The  value  for  l/3n(n2-l)=?30  (far  right  column) 


n 

p = .001 

.005 

.010 

.025  * 

.050 

.100 

£«(«*- 

1) 

4 

2 

2 

20 

5 

2 • 

2 

4 

6 

40 

6 

2 

4 

6 

8 

14 

70 

7 

2 

6 

8 

14 

18 

26 

112 

8 

6 

12  ' 

16 

24 

32 

42- 

168 

. 9 

12 

22 

28 

38 

50 

64 

240 

^10 

22 

36 

44 

60 

© 

92 

(330) 

T (Computed)  = 

■256.5 

is  not 

less 

than 

or 

equal 

to 

l/3n(n2-i)-T(Table)  so  we  do  not  accept  Hq.  Go  to 
Step  13.  That  is  256 . 5<330-74=256  is  not  true. 

--T(Computed)=256. 5 i£  greater  than  l/3n(n2-l)-T(Table) 
so  we  re.ject.  Ho  in  favor  of  H^:  There  is  a signifi- 
cant downward  trend  at  a=.05.  That  is.  since 

?.56.5>330-74  = 256 

accept  H-^. 

If  (B)  were  chosen,  we  would  not  need  the  value  of 
l/3n(n2-l).  The  value  T(Computed)=256.5  would  be 
tested, against  T(Table)=256.  Since  256.5>256,  the 
null  hypothesis  (Hq:  No  upward  trend)  would  be 
accepted. 

If  (G)  were  chosen,  we  would  have  to  use  the  p=a/2 
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=, 05/2=. 025  column  across  from  n=10  to  get  T (Table) 
=60.  We  must  also  use  l/3n(n2-l)=330  from  the  far 
right  column. 

Is  T (Table) 5T (Computed) 1-4-  n(n2-l)~T(Table)  true? 

? ? 

We  have:  605256. 55330-60  = 270 
Yes,  this  is.  true. 

We  must  now  accept  Hq:  There  is  no  significant  trend 
at  the  a=.05  level.  When  testing  for  both  kinds 
of  trend,  the  data  must  exhibit  a more  severe  trend 
before  the  null  hypothesis  is  rejected,  unlike  when 
testing  for  one  kind  of  trend  (Step  12) . 
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Table  IV 


Critical  Values  for  Spearman  Test 

(Ref  1:389) 


n 

p = .001 

.005 

.010 

.025 

.050 

.100 

4 

2 

2 

20 

5 

2 ■ 

2 

4 

6 

40 

6 

2 

4 

6 

8 

14 

70 

n 

$ 

2 

6 

8 

14 

18 

26 

112 

8 

6 

12 

16 

24 

32 

42 

168 

9 

12 

22 

28 

38 

50 

64 

240 

10 

22 

36 

44 

60 

74 

92 

330 

11 

36 

56 

66 

86 

104 

128 

440 

12 

52 

78 

94 

120 

144 

172 

572 

13 

76 

110 

130 

162 

190 

226 

728 

14 

106 

148 

172 

212 

246 

290 

910 

15 

142 

194 

224 

270 

312 

364 

1120 

16 

186 

250 

284 

340 

390 

450 

1360 

, 17 

238 

314 

356 

420 

480 

550 

1632 

18 

300 

390 

438 

512 

582 

664 

1938 

19 

372 

476 

532 

618 

696 

790 

2280 

20 

454 

574 

638 

738 

826 

934 

2660 

21 

540 

686 

758 

870 

972 

1092 

3080 

22 

652 

810 

892 

1020 

1134 

1270 

3542 

23 

111 

950 

1042 

1184 

1312 

1464 

4048 

24 

904 

1104 

1208 

1366 

1510 

1678 

4600 

25 

1050 

1274 

1390 

1566 

1726 

1912 

5200 

26 

1212 

1462 

1590 

1786 

1960 

2168 

5850 

27 

1390 

1666 

1808 

2024 

2216 

2444 

6552 

28 

1586 

1890 

2046 

2284 

2494 

2744 

7308 

29 

1800 

2134 

2306 

2564 

2796 

3068 

8120 

30 

2032 

2398 

2584 

2868 

3120 

3416 

8990 

ri 


V.  Conclusions  and  Recommendations 


Chapter  IV  may  be  extracted  and  used  as  a guidebook 
with  a minimum  of  changes,  such  as  the  page  numbers.  This 
chapter  describes  the  conclusions  reached  as  a result  of 
the  research  effort  and  offers  some  recommendations  for  the 
use  of  the  guide  and  for  further  study  and  development. 


Conclusions 

This  research  effort  has  produced  a number  of  methods 
by  which  a trend  in  measurements  over  time  may  be  analyzed. 
As  mentioned  in  the  first  chapter  (page  4) , an  early 
conclusion  was  that  a guide  was  needed  so  these  methods 
could  be  applied  by  those  with  little  mathematical  back- 
ground. While  the  guide  gives  step-by-step  procedures,  it 
does  not  relieve  the  user  of  all  subjective  judgement.  The 
variety  of  methods  available,  the  several  choices  of  signif- 
icance levels,  plus  the  choices  as  to  how  far  to  project 
trends  all  give  the  Trend  Analysis  (TA)  officer  flexibility 
to  do  what  he  feels  most  appropriate,  according  to  exper- 
ience. It  is  concluded,  then,  that  if  nothing  else,  the 
guide  provides  quantification  to  the  decision  made  by  the 
TA  officer,  and  he  can  show  (with  numbers)  the  decision- 
maker (the  Commander,  usually)  the  effects  of  the  choices 
made  during  trend-method  application.  That  is,  he  could 
say  that  a time-plot  suggested  a curved-line  trend,  and 
at  a .05  level  of  significance,  method  #6  (Guidesheet  #6) 
shows  a significant  departure  from  the  standard  will  occur 
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in  two  time-periods.  If  the  decision-maker  does  not  accept 
the  curved-line,  the  significance-level,  or  the  number  of 
time-periods  forecasted,  he  and  the  TA  officer  still  have 
a quantifiable  basis  upon  which  to  argue.  Alternative 
spproaches  would  be  easily  discussed  at  this  point.  The  TA 
officer’s  expertise  (or  decision-maker’s)  can  then  be 
applied  to  reversing  the  trend  as  appropriate.  The  guide 
can  then  be  used  to  evaluate  the  effects  of  policy  changes. 

Recommendations 

While  this  research  effort  discussed  eight  different 
methods  to  evaluate  the  significance  of  trends  and  one 
method  to  evaluate  correlation,  time  factors  prevented  the 
description  of  other  useful  methods.  Alternate  methods  of 
correlation  were  to  be  described  but  there  was  no  time. 
However,  if  the  user  goes  to  the  body  of  the  thesis  (chapter 
3) , he  will  see  at  least  a minimal  description  of  a non- 
parametric  test  for  correlation.  It  uses  the  Cox  and 
Stuart  test  for  trend  and  refers  to  a bibliographical 
reference  for  a more  complete  discussion.  Incidentally, 
that  reference  may  be  reasonably  well  understood  by  the 
non-mathematician . 

Another  recommendation  for  further  development  is  the 
use  of  the  Cox  and  Stuart  test  for  trend  (Guidesheet  #11) 
for  cycling  data.  The  writer  first  wished  to  address 
methods  that  could  be  used  when  there  were  a limited  number 
of  data  points.  Usually  cycling  data  does  not  show  up  until 
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measurements  extend  over  at  least  a couple  years.  If  is 
recommended,  then,  that  the  problem  of  cycling  data  be 
addressed,  either  by  the  Cox  and  Stuart  method  or  others. 

As  a start,  cycling  data  problems  are  addressed  in  Refer- 
ence 1,  page  134.  Also,  other  non-parametric  correlation 
tests  are  discussed  in  this  reference. 

If  this  guide  is  well  received,  it  may  be  quite  valu- 
able to  extend  the  regression  method  (Guidesheet  #5)  to 
multiple  linear  regression  analysis.  This  could  be  applied 
to  such  problems  as  determining  what  factors  affect  bombing 
CEPs,  for  instance.  One  interviewee  actually  mentioned 
that  he  wished  he  had  a method  by  which  he  could  evaluate 
what  had  the  greatest  impact  on  mission  success:  pilot 
proficiency,  navigator  proficiency,  weather,  equipment 
malfunction,  or  aircraft  flown.  Multiple  linear  regression 
techniques  are  ideally  suited  to  investigate  such  issues. 

Some  of  the  statistical  packages  available  for  compu- 
ter use  may  be  a rich  area  for  development  of  additional 
guides.  As  computer  access  becomes  easier,  a TA  officer 
may  be  able  to  apply  rather  sophisticated  tests  like  those 
available  in  SPSS  (Statistical  Package  for  the  Social 
Sciences)  if  he  had  a step-by-step  guide  to  help  him. 

These  recommendations  conclude  the  presentation  and 
discussion  of  this  research  effort.  The  objectives  have 
been  achieved,  and  the  writer  has  gained  a better  feel  for 
many  statistical  methods  plus  an  appreciation  for  the  prob- 
lems a TA  officer  has.  It  is  hoped  the  effort  put  into 
this  thesis  will  prove  useful  to  others. 
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This  is  used  in  the  Logistics  School  at  AFIT  and 
is  fairly  simply  written.  A lot  of  mathematics  is  not 
required,  but  it  is  a bit  more  rigourous  than  any  of 
the  above . 
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